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THE SIZE-DISTANCE INVARIANCE HYPOTHESIS 


F. P. KILPATRICK AND W. H. ITTELSON 


Princeton University 


Extremely common in the literature 
of visual perception are explanations or 
descriptions of the relationship between 
size, distance, and visual angle which 
resort to the use of mathematical ex- 
pressions which are special cases of the 
proportions existing in similar triangles. 
Such applications of Euclidean plane 
geometry to psychological relationships, 
with their consequent, often uncritical, 
mixture of physical and psychological 
variables, offer attractively simple solu- 
tions. However, the value of these 
solutions rests at bottom on the degree 
to which they actually are consistent 
with the observed relationships. Con- 
sequently, a careful scrutiny in rela- 
tion to experimental evidence of the 
hypothesis from which such mathemati- 
cal expressions are derived would seem 
to be in order. 

The basic expression follows directly 
from an elementary theorem of plane 
geometry. The proportions between 
corresponding parts of similar triangles 
shown in Fig. 1 can be expressed as 





It follows directly from this that 
specifying a (Fig. 2) determines the 
relationship between a, b, and 8. Spe- 
cifically this relationship is given by 


asin B 


b — acos 8 [1] 


tana = 





b 


Fic. 2. 


We can apply this simple geometry 
to physical objects in physical space, if 
we confine ourselves to sizes and dis- 
tances which are neither too small nor 
too large. 

Let S be the physical size of an ob- 
ject (Fig. 3) at a physical distance D 
from some reference point O, let a be 
the angle subtended by S at the point 
O, and let 8 be the deviation of S from 
the vertical, or the inclination of S. 
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Equation [1] becomes 


S cos 6 


D—Ssiné [2] 


tana = 





D 
Fic. 3. 


If we limit ourselves to angles suffi- 
ciently small so that tan a = a, and con- 
sider as a special case the condition 
§ = O (Fig. 4), equation [2] reduces to 


a= 2. [3] 








D 


Fic. 4. 


In this paper we shall consider the 
implications only of equation {3], bear- 
ing in mind at all times that it is only 
a special case of equation [2], which is 
in turn a special case of the three- 
dimensional relationship, including the 
most general case in which S is a curved 
surface. 

From equation [3] certain simple 
conclusions may be drawn. 


a. When the angle is given, size and 
distance vary proportionally. 

6. When angle and distance are 
given, size is uniquely defined. 
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c. When angle and size are given, 
distance is uniquely defined. 

d. When size is given, angle and dis- 
tance vary inversely. 

e. When distance is given, angle and 
size vary proportionally. 


This simple geometry of the right 
triangle is of interest to the student of 
space perception because the apparent 
sizes and apparent distances of objects 
seem to conform to similar or identical 
rules. Indeed, in many cases equation 
[3] seems quite accurately to describe 
the relationship between apparent size, 
apparent distance, and visual angle. 
This fact has led some psychologists to 
apply this equation to the description 
of psychological events. In many in- 
stances, the resulting equations in- 
judiciously mingle physical and psycho- 
logical quantities. An example is 


apparent size 





east ange = physical distance 
Another familiar example which leans 
more heavily on psychological quanti- 
ties is 


r 


P : apparent size 
retinal size = K - 





apparent distance 


However stated, one assumption is 
central to all such applications of sim- 
ple geometry to perceived space: a reti- 
nal projection or visual angle of given 
size determines a unique ratio of ap- 
parent size to apparent distance. This 
is the Size-Distance Invariance Hy- 
pothesis. 

This hypothesis has been utilized by 
psychologists in two ways. First, as 
explanation, some form of invariance 
hypothesis is central to many theories 
of the perceptual constancies. As thus 
used, the most general statement of the 
invariance hypothesis holds that con- 
stancy depends on the existence of two 
processes which vary reciprocally so 
that the resultant product always re- 
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mains the same. This type of expla- 
nation as related to the size-distance 
problem was first proposed by Wheat- 
stone; it is central in the Gestalt theory 
of constancy, implied in much of Bor- 
ing’s writing, and again crucial and 
explicit in Gibson’s theory, to mention 
a few important examples (cf. the dis- 
cussion in [11]). Also, it has found 
expression in recent issues of this jour- 
nal in the writings of Schlosberg (16), 
who employed it to explain experiments 
in the visual perception of size and dis- 
tance, including a number of the Ames 
demonstrations, and of Gelinsky (8), 
who used it as a basis for a derivation 
and quantitative formulation of “a gen- 
eral unifying law of visual space per- 
ception .. .” (p. 475). 

In this paper we shall not be directly 
concerned with the use of the invari- 
ance hypothesis as an explanatory con- 
cept in perceptual theorizing, but rather 
with its second major use, that of de- 
scription. We choose to concentrate on 


this latter aspect because it is more 
fundamental; the adequacy of a formu- 
lation as explanation depends directly 


on its adequacy as description. Is the 
invariance hypothesis adequate to de- 
scribe the observed relationships be- 
tween apparent size, apparent distance, 
and visual angle? The existing evi- 
dence will be examined and a new ex- 
periment reported in an effort to answer 
this question. 

That the size-distance invariance hy- 
pothesis does describe this relationship 
in many instances is well attested. No 
attempt will be made here systemati- 
cally to survey the literature on this 
subject but rather simply to indicate 
the general nature of the supporting 
evidence. Three cases may profitably 
be distinguished. 

a. Visual angle constant. The in- 
variance hypothesis demands that ap- 
parent size and apparent distance vary 
proportionally, and indeed this is the 
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most commonly reported result. Duke- 
Elder has summarized current evidence 
of this case by noting that “If we esti- 
mate the distance to be much greater 
than it is, we consider that if the ob- 
ject at this distance is so big, it must be 
very big indeed. Conversely, if an ob- 
ject seems nearer than it is, it seems 
to be smaller than it actually is” (6, 
p. 1071). This case is also the one 
with which Emmert’s law is concerned, 
whose usual interpretation is in support 
of the invariance hypothesis. 

b. Apparent distance constant. The 
invariance hypothesis demands that ap- 
parent size vary directly with visual 
angle, and again this is the most com- 
monly reported result. This case is, in 
fact, closely related to the finding of 
most size-constancy work, that the 
proper estimation of size is dependent 
upon the proper estimation of distance. 

c. Apparent size constant. The in- 
variance hypothesis demands that ap- 
parent distance vary inversely with 
visual angle. This general finding has 
been repeatedly verified in a variety of 
cases. 

It is clear, then, that considerable ex- 
perimental evidence can be mustered in 
support of the invariance hypothesis, 
and indeed most writers on the problem 
of the relation between apparent size 
and apparent distance would seem to be 
in general agreement with this hypothe- 
sis. The prevailing view can be sum- 
marized in Sanford’s words written half 
a century ago, “. . . size and distance 
are mutually determining. If the ap- 
parent distance is constant, the appar- 
ent size of the objects caanges directly 
with the size of the retinal image; while 
if the apparent size is constant, the 
apparent distance changes inversely 
with the image. These are facts of very 
common observation” (15, p. 198). 

Possibly the case could be allowed to 
rest there, were it not for the fact that 
there is also considerable experimental 
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evidence in support of the proposition 
that no aspect of perception can be a 
simple and invariant function of a few 
variables. This proposition is derived 
from the more general theory that the 
perceptual process is one which always 
and necessarily includes all aspects of 
the functioning of the organism (1, 4, 
5, 11, 12, 13). To find that the size- 
distance relationship is an invariant 
one as so commonly reported would 
constitute an important exception to 
that theory. Consequently, it would ap- 
pear to be of considerable systematic 
importance to note that the findings 
with respect to the invariance hypothe- 
sis are not entirely one-sided. Al- 
though the contradictory evidence is 
scattered and possibly not conclusive, 
it stems from a variety of experimental 
approaches. 

a. Varying accommodation while look- 
ing at a fixed object. The relation be- 
tween accommodation and apparent 
size under these conditions is well es- 


tablished. Anyone can verify for him- 
self the truth of Hering’s report that 
“when changing from a monocular fixa- 
tion of a distant object to viewing one 
close by, the distant object appears to 


diminish in size. Conversely the size 
of the nearer object increases when ac- 
commodating for the farther” (10, p. 
172). The relation of this effect to ap- 
parent distance is by no means so 
clearly established. This problem has 
never been systematically investigated, 
but it has been mentioned by many 
writers, who, although they are not in 
complete agreement, would seem to 
support von Kries in stating: 


When we gaze at any object with one eye 
screened, it is easy to notice that its ap- 
parent size varies with the state of accom- 
modation. Every exertion of accommoda- 
tion is accompanied by an apparent reduc- 
tion in size, and every relaxation by an 
apparent magnification. The simplest ex- 
planation of this well-known and easily ob- 
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served phenomenon would be to suppose 
that as the result of accommodation for 
near vision the object appears to be at the 
distance that ordinarily corresponds to this 
accommodation, that is, too near; and 
hence, as long as it subtended the same 
visual angle, its absolute size would appear 
to be less than it was. The trouble about 
this explanation is that, as Donders long 
ago rightly pointed out as something re- 
markable, this is not what actually hap- 
pens. The object does appear to become 
smaller when accommodation is exerted, 
but it by no means appears to come nearer 
at the same time; on the contrary, it ap- 
pears to recede. Thus, with a constant 
visual angle, we see here apparent size and 
apparent distance vary in the opposite sense 
(17, p. 389). 


This is clearly and explicitly in con- 
tradiction to the invariance hypothesis. 

It should, however, be pointed out 
that this evidence, as well as that in 
the following section, is complicated by 
the possibility that the physical size of 
the retinal projection varies with 
changes of accommodation. (See, for 
example, discussions in Neumueller 
[14] and Duke-Elder [6]). 

b. Accommodative micropsia due to 
partial paralysis of accommodation. 
Partial paralysis of accommodation, 
most commonly by means of atropine, 
is accompanied by a reduction in ap- 
parent size, or micropsia. While this 
fact is well established, the relation to 
apparent distance again is not com- 
pletely clear. Duke-Elder implicitly 
attributes this effect to the invariance 
hypothesis when he writes that “. . . in 
partial paralysis of accommodation .. . 
we volitionally expend a greater effort 
to see an object distinctly, . . . and we 
thus imagine the object to be quite 
near; but since the retinal image is of 
the same size, we think the object must 
be smaller than it is” (6, p. 1072). 
Hering, however, reported findings of 
Aubert which specifically refute this 
explanation. “Aubert after instillation 
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of 1/500 grain atropine, saw letters 
with the atropinized eye at a distance 
of 20 feet, apparently half the size as 
seen with the other. . . . In spite of this, 
strange to say, the apparently smaller 
objects did not seem nearer but farther 
than in the sound eye” (10, p. 172). 
This report is clearly in contradiction 
to the invariance hypothesis. 

c. Visual tau effect. “If three dis- 
crete points of light are successively 
presented, and the distance between the 
points of light is equal, but the time 
interval between the first and second 
points of light is greater (lesser) than 
the time interval between the second 
and third, then the perceived space be- 
tween the first and second will be 
greater (lesser) than the perceived 
space between the second and third 
points of light” (7, p. 483). Contrary 
to what one might expect in terms of 
the invariance hypothesis, there is no 
evidence that this well-attested phe- 
nomenon is accompanied by any appar- 
ent distance change. Of course, it might 
be argued that the perceived space be- 
tween the successively presented lights 
is not “size” in the usual sense, as the 
lights which set the “space limits” do 
not stimulate the retina simultaneously. 
Nevertheless, pending further experi- 
mentation, the visual tau effect might 
reasonably be considered an exception 
to the invariance hypothesis. 

d. Subjective factors. Some recent 
evidence seems to indicate that the ap- 
parent size of an object may be influ- 
enced by subjective factors such as the 
importance of the object to the observer 
(cf. discussions in [2,3]). While such 
evidence is as yet inconclusive, if it 
should be substantiated, it would con- 
stitute an important exception to the 
invariance hypothesis. 

e. Statistical considerations. Much 
if not all of the evidence which tends to 
support the invariance hypothesis is 
statistical in nature. That is, it states 
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that the invariance hypothesis describes 
the mean of a large number of observa- 
tions in a large number of cases. But 
as usually expressed, the invariance 
hypothesis is not a statistical state- 
ment; it is an equation which, within 
the limits of accuracy of measurement, 
must apply to the individual case. 
Therefore, even if the invariance hy- 
pothesis always adequately described 
the mean performance (which, as we 
have seen, it probably does not), it 
would still have to be modified drasti- 
cally before it would be acceptable as 
a description of psychological events. 

It is clear, then, that there exists a 
body of evidence which, although it ap- 
proaches the problem obliquely, is yet 
relevant and contradictory to the in- 
variance hypothesis. The evidence at 
the very least suggests that this hy- 
pothesis may not be adequate to de- 
scribe the relationship between appar- 
ent size, apparent distance, and visual 
angle in all cases. There is need, never- 
theless, for an experiment specifically 
designed to test the applicability and 
generality of the invariance hypothesis. 
In the experiment reported below visual 
angle was kept constant, apparent dis- 
tance was influenced by over-lay indi- 
cations, and apparent size by the size 
of the background. Reports were ob- 
tained of changes in apparent size and 
apparent distance and were analyzed 
as supporting or contradicting the in- 
variance hypothesis. 


DESCRIPTION OF THE EXPERIMENT 3 


The apparatus employed has been 
described and illustrated by the au- 
thors in a previous publication (13). 
Briefly, it consisted of a trapezoidal 
shape cut out of 14-in. aluminum, 39 in. 
long, 4714 in. and 25 in. high at the 
long and the short ends, respectively. 


*1 The authors are indebted to Charles T. 
Albert for collecting the data in this experi- 
ment. 
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Three rows of five openings somewhat 
like the panes in a window were cut out 
of the trapezoidal shape, and on it were 
painted shadows like those commonly 
seen on an ordinary window. 

This trapezoid was suspended from 
the ceiling by two wires so that the 
sides were vertical and the center of 
the frame 46 in. above the floor. A 
black thread passed through the middle 
opening of the shorter side and was at- 
tached to four pulleys so that the 
thread was horizontal to the floor, 48 
in. above the floor, and at right angles 
to the line of sight. A motor attached 
to one of the pulleys drove this thread 
at the rate of about half an inch per 
second. This apparatus was viewed 


from a distance of 10 ft. utilizing a 
headrest which permitted monocular 
observation only, and which was adjust- 
able so that eye level could be main- 
tained at 46 in. above the floor. 

Black drapes were hung behind the 
trapezoid and between the trapezoid 


and the observer. In this latter drape, 
4 ft. from the viewing point, a rectan- 
gular opening was cut through which 
the trapezoid could be observed. Be- 
hind this drape two 15-w. lights in 
photographic reflectors were mounted 
so that they gave balanced illumination 
to both sides of the trapezoid. With 
the exception of these two lights the 
room was completely dark. 

Two objects were provided which 
could be attached to the black thread 
and thereby moved through the trape- 
zoid. These objects were an ordinary 
playing card (three of diamonds, 214 
in. X 3% in.) and a piece of cotton ap- 
proximately 2 in. in diameter. 

Observers were twenty-four under- 
graduates at Princeton University, none 
of whom had any previous knowledge 
of the nature or purpose of this experi- 
ment. All of them had, or were cor- 
rected to, 20/20 visual acuity. 

The trapezoid described above ap- 
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pears to O to be a rectangle with the 
actually far edge of the trapezoid ap- 
pearing as the near edge of the rectan- 
gle and vice versa. Under these condi- 
tions, objects carried through the trape- 
zoid in a straight path by the moving 
thread appear typically to move through 
an “S’-shaped path in the horizontal 
plane (see 13). The problem, as re- 
lated to the study of the invariance hy- 
pothesis, is to determine whether this 
change in apparent distance as the ob- 
ject moves is accompanied by the cor- 
responding change in apparent size 
which would be predicted from the in- 
variance hypothesis. In order to secure 
this information, each O was presented 
with three different situations. 

1. Trapezoid alone. The O viewed 
the trapezoid and was asked to describe 
its orientation in space and its approxi- 
mate dimensions. 

2. Trapezoid with moving objects. 
Each of the two objects described above 
was viewed moving through the trape- 
zoid from right to left and also from 
left to right. The order of presentation 
was randomized. The O viewed each 
condition twice and was asked to de- 
scribe any changes in apparent size or 
distance of the moving object. 

3. Trapezoid with stationary objects. 
Two playing cards were suspended from 
the stationary wire with their inner 
edges lined up from the viewing point 
with the outside edges of the trapezoid. 


FINDINGS 


1. Trapezoid alone. All Os de- 
scribed the trapezoid as a rectangle 
tipped sharply sideways with its left 
edge near and its right edge far. Es- 
timates of its dimensions ranged be- 
tween 11% it. x 2 ft. and 5 ft. x 3 ft., 
with 18 out of the 24 Ss estimating the 
horizontal dimension to be at least 3 ft. 

2. Trapezoid with moving objects. 
In 75 of the 96 trials, definite move- 
ment in depth of the test object was 
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reported, 3 ft. or more in 39 of the 
trials and less than 2 ft. in only 12 of 
the trials. Perceived movement in 
depth was about equally common for 
the two test objects; in the case of the 
playing card, such movement was seen 
in 36 out of 48 trials, while with the 
cotton it was seen in 39 out of 48 trials. 
No significant effect of the right-left or 
left-right orders appeared for either 
test object, although depth movement 
was reported a little more often for 
both objects in the right-left trials. 
These depth changes, when seen, were 
quite consistent in length throughout 
the trials of an individual, and, in gen- 
eral, corresponded to O’s earlier esti- 
mate of the horizontal dimension of the 
frame. There can be little doubt, then, 
that most Os in the majority of their 
trials did see the test object change dis- 
tance, and that these changes were of 
sufficient magnitude relative to the 
viewing distance of 10 ft. that a cor- 
responding change in size should have 
been perceived in all such cases if the 
invariance hypothesis were to be sub- 
stantiated. 

a. Playing card. In only 14 of the 
37 trials where either size or distance 
change was seen could the Os’ reports 
be interpreted to be in line with the 
invariance hypothesis. These 14 re- 
ports were that when the card ap- 
proached, it became smaller, and when 
it moved away, it became larger. In 
the remaining 23 trials, three quite dif- 
ferent size-distance perceptions were re- 
ported, all of them distinguished by 
their lack of accord with the invariance 
hypothesis. The most common of these 
deviant observations, reported in 15 
trials, was that a marked change in dis- 
tance was seen without any apparent 
alteration in size. Another report, oc- 
curring in 7 instances, was exactly the 
reverse of the invariance-hypothesis 
formulation; when the card approached 
it became larger, when it receded it 


TABLE 1 


REPORTED S1zE-DISTANCE PERCEPTIONS OF 
THE MOovING OBJECTS 


(24 observers, 4 trials each) 








Number of Trials 





Response Category 


Card | Cotton | Total 





No change in size or distance 11 9 20 

Changed in accord with invari-| 14 15 29 
ance hypothesis 

Changed, but not in accord with 
invariance hypotheses* 23 24 


47 
Total trials 48 48 96 














* This category includes three types of size-distance 
relationships: (a) change of size and not distance, (b) 
change of distance and not size, and (c) decrease of 
size with increase in distance or vice versa. 


became smaller. In the remaining in- 
stance no change in distance was re- 
ported but the O said he saw the card 
change size. 

b. Cotton. The results with the 
moving piece of cotton were substan- 
tially the same as those reported for 
the playing card. The numerical dis- 
tribution of the findings is reported in 
the second column of Table 1. 

3. Trapezoid with stationary objects. 
Here, also, with the visual angles sub- 
tended by the two playing cards the 


TABLE 2 


REPORTED S1zE-DISTANCE PERCEPTIONS IN 
THE Static SITUATION, R1iGHT-HAND 
CarD STATED IN RELATION TO LEFT 


(24 observers, 1 trial each) 








Number 


Response Category of Trials 





Same size and distance 5 
In accord with invariance hypothesis: 
Larger and farther 10 
Not in accord with invariance hypoth- 
esis: 
Larger and nearer 
Larger and same distance 
Vacillating between “‘larger and 
farther” and “larger and same 
distance”’ 


Total trials 
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same, there is no invariant relationship 
between reported apparent size and dis- 
tance. The modal category of response 
(10 Os) is in line with the invariance 
hypothesis, but almost an equal num- 
ber of Os (9 cases) reported perceptions 
which could not be so interpreted. The 
results of this part of the experiment 
are summarized in Table 2. 


DISCUSSION 


The size-distance invariance hypoth- 
esis states that for a given visual angle 
there is a unique and constant ratio of 
apparent size to apparent distance. 
The results reported above, together 
with the earlier evidence, clearly indi- 
cate that this hypothesis is not ade- 
quate to describe this relationship in 
all cases. The invariance hypothesis 
therefore loses its status as a powerful 
explanatory concept and becomes rather 
a description of results obtainec under 
conditions which have yet completely 
to be specified. 

This conclusion certainly should not 
be surprising. The invariance hypothe- 
sis is commonly expressed in the form 


apparent size 
apparent distance 





[4] 


or some equation reducible to this. 
Such equations represent an attempt to 
take a relation derived under the 
axioms of Euclidean plane geometry 
and to transfer it bodily to the descrip- 
tion of psychological events. There is 
no a priori reason why this should be 
possible. 

When applied to physical objects in 
physical space, and within certain limit- 
ing conditions, the equation 


visual angle = 


___ physical size 
pee! physical distance 





[S] 


always holds. This equation is useful 
to the psychologist because it defines 
the physical limits of the experience of 
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the organism. Whenever the retina has 
been stimulated along a length describ- 
able by the visual angle a, this has al- 
ways been produced by physical ob- 
jects at physical distances uniquely 
specified by the above equation (or 
more generally by equation [2]). There 
have never been any exceptions to this 
rule; there never will be. When equa- 
tion [4] adequately describes the re- 
ported perceptions, this simply means 
that the perceptions most nearly ap- 
proximate the physical situations which 
have been related to all the size-distance 
experience of the organism, and which 
can be summarized by equation [5]. 

When behavior of this type is en- 
countered, it can be labeled invariance 
behavior. The conditions under which 
invariance behavior can be observed 
have yet to be determined, but should 
provide a more satisfactory base line 
for evaluating experimental results than 
the usual criterion of constancy. It is, 
so to speak, what one would expect the 
organism to do if there were no reason 
for doing otherwise. 
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This paper is intended to further 
the analysis that has already been par- 
tially developed (6). It is now hoped 
to suggest steps of a more positive na- 
ture designed to develop adequate lan- 
guage systems for the discussion of be- 
havior theories. It seems to the writer 
that two of the most pressing needs in 
modern behavior theory are: (a) The 
development of a more precise language 
for the discussion of the theory, since 
much of existing behavior theory is 
vitiated by an apparent inability to 
formulate sensible questions. The ques- 
tions asked prior to experiment, for 
example, or those which form the basis 
of theory construction, are, all too 
often, either meaningless, circular, or 
insufficiently precise, and this is partly, 
at least, a function of the language 
used. (b) The development of sys- 
tematic theories of behavior in molecu- 
lar terms, as well as in molar terms, ap- 
pears necessary to remove the vague- 
ness that surrounds molar constructs. 
This vagueness can be greatly reduced 
by breaking the constructs down to a 
molecular level. This paper is wholly 
concerned with finding a solution to (a). 


THe DEVELOPMENT OF FORMALIZATION 


“Formalization brings out clearly to 
what a large extent problems in science 
are not empirical questions but ques- 
tions of linguistic convention. This 
also helps to eliminate useless contro- 
versy.” This quotation from a memo- 
randum by Woodger‘ forms the basis 
for this discussion. Formalization is, 
broadly, the process of reducing our 
linguistic systems to more precise logi- 


1 Unpublished manuscript. 
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cal forms; axioms, definitions, and so 
on. In this paper it is intended to indi- 
cate the desirable development which 
formalization of behavior theory should 
take. This approach, it is hoped, will 
especially be brought to the notice of 
those narrow experimentalists who re- 
peatedly call for experiment, and decry 
theory, and continue to stumble through 
the maze of science erroneously believ- 
ing themselves to be dealing wholly 
with facts, and never with linguistics. 

It is apparent that there are many 
symbolic logics of various forms, some 
of which have already been used in the 
psychological field, so far with some- 
what limited success. Such systems 
may be premature in a subject which is 
not clearly developed as a science. 
Their compact shorthand, although in 
some form certainly desirable as an 
ultimate aim, needs to be translated back 
into longhand for many purposes. Thus 
our aim is a rigorization of language, 
or the setting up of a compromise be- 
tween rigid postulational systems of a 
narrow kind, such as the propositiqnal 
calculus, and the vague, inadequate 
language of ordinary discussion. The 
principal trouble in existing psycho- 
logical languages is either (a) they 
sprawl, introduce enormous numbers of 
new terms, and are vague and lend 
themselves to meaningless formulations, 
or (6b) they are impressively precise 
and impose a rigidity on our knowl- 
edge of behavior that leads to an arbi- 
trariness which allows of no real pre- 
diction. More precisely, (a) they allow 
no prediction which is precise, or (0b) 
allow precise prediction which is almost 
wholly false to facts. 
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Ideally it seems that we should have 
a hierarchy of languages which moves 
from the very general to the extremely 
particular and allows us to suit the 
precision of our language to the preci- 
sion desired in the discussion. We talk 
vaguely in ordinary conversation, but 
become more precise as the dimensions 
of our topic become more precise, and 
ultimately we are forced back, as is 
necessary, to definitions and even to the 
assumptions underlying our theoretical 
system. It is certainly true that this 
last and highest approximation, to 
which we refer in cases of obscurity, 
could well be in mathematical terms, or 
some such rigorous form. This paper, 
then, will attempt to suggest the way 
we may fill the gap between our ordi- 
nary language and our logical calculi. 

In an earlier paper (6) the vague- 
ness accruing at certain points even in 
rigorous theory, i.e., in the use of logi- 
cal constructs, undefined terms, philo- 
sophical directives, many-meaninged 
terms, etc. was noticed. This should be 
borne in mind throughout this paper 
and will be referred to from time to 
time. 


THEORY CONSTRUCTION 


In science, the search for adequate 
methods of theory construction con- 
tinues, and psychological theory is es- 
pecially interested in deriving some 
relatively precise methods. In this, 
and subsequent paragraphs, it is in- 
tended to outline some of the problems 
and one way in which they might be 
solved. 

In the first instance we may choose 
to build our models in at least one of 
two ways. We start from our experi- 
ence in either case, but we may find it 
convenient to assume the existence of a 
real external world behind experience, 
which needs to be observed and mapped 
on various levels of language. This is 
what Pragmatic Realism aims to do 
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(5, 15). We may, on the other hand, 
start from experience, without making 
any assumptions about a real external 
world; thus our theoretical model comes 
to consist of observational propositions, 
and our vague and difficult formula- 
tions will be defined operationally. 
This last method is generally referred 
to as some brand of Phenomenalism, 
Positivism, or Operationism (1, 2, 3). 
I would submit that there is no ob- 
vious reason for not accepting a mixed 
Operational-Realist system wherein we 
may regard the Realistic model as being 
generally adequate, and the use of an 
operational definition as a form of 
limiting test where disputes, difficulties, 
etc. occur—generally at the borders of 
our knowledge at any given time. The 
ability to use such operational defini- 
tions is a guide to the degree of validity 
of the theory. The question of setting 
up criteria for deciding what consti- 
tutes a valid operation is a matter for 
convention, as is the problem of 
“observation-in-principle.” To  reca- 
pitulate, we are asserting that concepts, 
or formulations, need not, in general, 
be defined in a strictly operational 
form, but that we may conveniently 
use a realistic model subject to an op- 
erational test, wherever possible, and 
then define our terms in a manner we 
shall call a contextual definition. 
There are now further problems in 
theory construction to be settled. 


“INTENSIONAL” AND “EXTENSIONAL” 
MopELs 


The question now to be considered is 
the actual method of construction that 
is best suited to behavior theory. Let 
us first consider the classical postula- 
tional method which has proved to be 


so successful in mathematics and 
mathematical science. The theories of 
Groups, Matrices, Euclidean Geometry, 
and Boolean Algebra are well-known 
examples of postulational systems. (It 
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will be noted that these markedly suc- 
cessful cases are essentially analytic, 
or conceptual systems.) In psychology 
Hull has, of course, used a kind of 
postulational method in the statement 
of his theory of behavior (9). 

The postulational technique itself is 
not the question which I primarily wish 
to discuss, as I intend to assume that 
in some form this is essential. The 
question is rather: Is the intensionally 
defined system adequate? To put it 
in other words, we may start from a 
set of undefined terms and from their 
derived postulates, and then introduce 
the various conceptual terms such as 
learning, perception, motivation, habit, 
habituation, adaptation, reinforcement, 
and so on. These terms are not neces- 


sarily all to be included in the theory, 
but some such terms, many-meaninged 
in character, must certainly arise. The 
many-meaninged, or multiordinal (hav- 
ing a different meaning on each level of 
language) types of term must, and will, 


arise in scientific theory; and they 
present the central problem. Now 
again to put the question: are our defi- 
nitions to be intensional?—i.e., is a 
class definition adequate? May we, for 
example, talk of the class of reinforcers, 
as does Meehl (14)? The answer ap- 
pears to be that the notion of class is 
often objectionable and the source of 
both vagueness and crude approxima- 
tion. The alternative is an extensional 
definition, which involves the enumera- 
tion of properties, and no question of 
class definition, although the word 
“properties” also has its drawbacks, as 
has been pointed out by Woodger (16). 

Let us now compare what I have 
called intensional and extensional defi- 
nitions. If class definition is accepted, 
then we set up a class of some kind, 
subject to some class criteria; we cate- 
gorize in fact, and in doing this, differ- 
ences are overlooked at the expense of 
similarities. The method is compact 
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and powerful. Consider, for example, 
the enormous power inherent in the 
“Boolean Algebra of Classes.” The 
power of the method depends on our 
ability to categorize without too much 
distortion. Can this always be done? 
The answer must be: not always in 
subjects served by ordinary language 
without vagueness and arbitrariness. 
On the other hand, extensional defini- 
tions do not allow conciseness, conveni- 
ent generalization, or a calculus; they 
do, however, gain in accuracy of state- 
ment. The trouble essentially sur- 
rounds the question of classification. 
As soon as we classify, we overlook 
differences. 

The problem may be solved by a 
compromise. It is literally true that 
behavior theory needs the benefits of 
both systems. Thus extensional defini- 
tions may be thought of as limiting 
cases of intensional definitions, when 
the number of classes tends to infinity. 
The form of definition to be employed 
must therefore allow a degree of ap- 
proximation, suiting the dimensions of 
the subject discussed. We shall pro- 
pose, then, that our definitions should 
start from a glossary definition—a 
short statement of the common proper- 
ties involved in the various uses of the 
term. Then on each occasion that the 
term is used, it will be contextually de- 
fined. The contextual definition may 
be explicit, and certainly it should be 
capable of explicit definition. If we 
now place this common glossary defini- 
tion, and its set of contextual defini- 
tions, in a model of an infinity of levels 
of language, it will be seen that we 
have made explicit a simple, and power- 
ful, device for building language sys- 
tems. The glossary definition, at any 
level, may be taken as a class defini- 
tion, if, for example, for quantification 
purposes, it was necessary to have a 
precise, if arbitrary, statement. This 
may be profitable on some dimensions 
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at some times. At the same time there 
will exist a general extensional back- 
ground of levels of propositions with 
any degree of accuracy of definition 
possible. A simple example of this 
usage must suffice. At the level of 
molar behavior theory a central term is 
“learning.” A considerable amount of 
discussion takes place without an 
agreed criterion for learning. The re- 
sult is confusion, in many cases, not 
only because two people use different 
criteria, but because they both, them- 
selves, use the term in various ways. 
In order to discuss “learning,” a start 
could be made with a glossary defini- 
tion such as the one from the mono- 
graph of Hilgard and Marquis (8), and 
each time the term was used it would 
be in this general sense, unless some 
explicit contextual definition was ap- 
pended. 

Science has, of course, progressed by 
bringing more and more under less and 
less, in the sense that certain postu- 
lates can be used to subserve more and 
more diverse and complex propositions. 
It is this generalization, simplification, 
and unification that makes science so 
powerful. However, the price paid by 
this advance can be too great if it 
leads to arbitrariness, and this appears 
to be the present position in behavior 
theory. Certainly it appears as yet to 
be impossible to encompass the great 
variety of behavior in a rigid postula- 
tional system. This, at any rate, is an 
assumption we shall make. Thus ex- 
plicitly an attempt must be made to 
build an extensional system which 
might, as it were, be fixed and made 
exact, at any level, at any time. In 
practice this will only be useful if we 
can suit the fixing to the appropriate 
dimensions. 

Now our criteria for theory construc- 
tion may be summarized. We must 
state our undefined, or ostensively de- 
fined terms, and in giving our ostensive 


235 


discussion we must, as far as possible, 
explicitly state our philosophical direc- 
tives (i., those personal prejudices, 
etc. that we possess). Next, we must 
build a set of postulates which will con- 
tain words such as “learning,” “percep- 
tion,” and so on. Avoidance of these 
particular terms only throws the bur- 
den on some other many-meaninged 
terms. What has been said does not 
mean that we are rejecting the rigid 
postulational methods as used by Hull, 
and his associates (9, 10), or that we 
are returning to straightforward gen- 
eral discursive methods such as have 
been generally used in the work of 
Hebb, Thorndike, Tolman, etc. The 
general discursive method is too vague 
and puts us in a position which leads to 
discussion about terms, and the linguis- 
tic conventions, which are not clear-cut, 
and this state of affairs seems quite in- 
adequate. The alternative, however, is 
not at this stage to go to the strictest 
type of Hull system. This—and this 
is the principal point made in this ar- 
ticle—leads to an arbitrariness which 
is too narrow for the present develop- 
ment of behavior theory. 

The steps that could be taken in 
learning theory would be to reduce the 
number of undefined concepts by de- 
fining in terms of a small nucleus of un- 
defined terms (necessary in any verbal 
system), and by setting up a glossary 
definition for vague, previously unde- 
fined concepts, many-meaninged terms, 
etc. where the “glossary definition” 
should refer to the common properties 
of all usages, and then a particular 
contextual definition given to each 
usage. If a term is used in, say, two 
ways that have no common property, 
then a new term should be introduced. 
If this pattern is followed explicitly, 
rather than implicitly, much clarity 
would result. 

. It is necessary to bear in mind the 
distinction made between propositions, 
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which are part of our strict behavior 
model, and propositions of the metalan- 
guage, which refer to the propositions 
of the model. This distinction is im- 
plicit in our linguistic theory which is 
assumed to be on any of a possible 
infinity of levels. This sort of distinc- 
tion (13) is necessary to cater for anti- 
monies, and is analagous to the Russell 
“Theory of Types.” With respect to 
the terminology which we may employ, 
it is, I believe, true that the Carnap 
axiom (4) holds and that each subject 
will develop its own terminology, and 
will gradually discard the terms that 
perform no useful function in the 
subject. 

Thus it is proposed that the con- 
struction of behavior theory involves 
first the rigorization of our language 
systems. We then proceed extension- 
ally, by careful contextual definition of 
terms, to theorems, which are derived 
from postulates which refer both to our 
explicitly stated undefined terms and 
It is worth 


our glossary definitions. 
mentioning that some such language 
system has recently been developed by 
Woodger (16) and previously a some- 
what similar system was developed by 


Lesniewski. Woodger aims to avoid 
the enumeration of properties and the 
difficulties of class, and he has built an 
extensional type of calculus which al- 
lows discussion to take place and allows 
degree of accuracy to be involved by 
devices known as “time-slicing.” This 
is similar to the methods of “indexing” 
suggested by Korzybski (13), and in- 
volves, at the crudest level, the distinc- 
tion between Walter Scott 1843 and 
Walter Scott 1844. This can clearly 
allow of degrees of accuracy, analagous 
to the infinitesimal calculus, as 1843 
can be subdivided into March 1843, 
March 3rd 1843, and so on indefinitely 
as we take smaller and smaller “time- 
slices.” This is precisely the sort of 
graded accuracy on various levels of 
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discourse, or levels of abstraction, that 
is needed in behavior theory, although 
we have used the phrase “enumeration 
of properties,” to avoid the notion of 
definite class. It may now be seen that 
the precision of glossary and contextual 
definitions is essentially related to di- 
mensions, i.e., what is adequate as a 
context definition on one level, may 
only serve as a glossary definition on 
another level. 


APPLICATIONS IN BEHAVIOR THEORY 


Of all branches of behavior theory, 
theories of learning and perception are 
the most philosophically and semanti- 
cally sophisticated. The fields of ab- 
normal behavior, Personality, and so 
on, are in a much more primitive state 
of development and are far from the 
Carnapian state of throwing out the 
functionally useless terms. Thus the 
sort of linguistic systems suggested ap- 
ply even more obviously to the latter 
fields, but I believe they are also enor- 
mously important to the former. The 
whole subject of redefinition of psycho- 
logical terms could well fill many 
volumes. 

The first question is how far may 
terminology be stretched. The best 
example extant is perhaps the terminol- 
ogy of the conditional reflex. Recently 
the Pavlovian terminology of the con- 
ditional stimulus, conditional response, 
unconditional stimulus, unconditional 
response, generalization, etc. has been 
extended from the original “meaning” 
(8, 12) to include more complicated 
behavior activities. This involves the 
extension of definitions of some of the 
terms, and has the advantage of bring- 
ing more and more forms of behavior 
under the same postulational system. 
However, this is only worth while if 
the amount of stretching necessary in 
the definitions is not too considerable; 
otherwise, it would seem more useful to 
introduce a new term, and utilize it un- 
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til such time as the integration may 
be effected without undue stretch. The 
actual test must be that of overlap re- 
ferred to previously. In any event 
here we may clearly see the use of the 
contextual definition. If we are dis- 
cussing generalization, it is important, 
granted that it has to some degree been 
stretched, to say precisely, in each in- 
stance, what we mean by generaliza- 
tion, and if necessary proceed back to 
the observations to be carried out to 
elicit the phenomenon we are naming. 

The broader problem of whether con- 
ditioning terminology should be used, 
or not, in the wider context of Instru- 
mental conditioning, in a laudable at- 
tempt to bring more of behavior under 
one concise terminology, can only be 
decided by attempting to verify the 
theorems that emanate from the as- 
sumptions, i.e., is it possible to carry 
out the plan without serious distortions 
and omissions? 

Gibson (7) has carried out some- 


thing of the desired type of analysis 
for the term “set” with the idea of find- 
ing some common meaning for the vari- 


ous usages. He found none which 
might serve for a glossary definition, 
but instead, unearthed ambiguities and 
contradictions. It is precisely this 
messy uncontrolled growth of terminol- 
ogy that ultimately leads to a lack of 
clarity and vagueness in the subject 
that makes prediction impossible. It 
is clearly essential that further sur- 
veys of the kind made by Gibson 
should be carried out in order to clear 
up the confusions of the past, and to 
introduce greater rigor in our formula- 
tions in the future. 

It is worth noting, parenthetically, 
that there is another reason why vague- 
ness enters the behavior theories so far 
formulated: the neglect of the genuine 
comparative method in psychology. 
Terms such as “set,” “reinforcement,” 
etc. are being stretched too far if made 
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to refer to all organisms, without ref- 
erence to contextual differences. 

In Hull’s system we find, in spite of 
the relatively precise, perhaps over- 
precise, nature of its formulations, 
vagueness occurring over such words 
(undefined terms of the system) as 
“response.” The truth of many of the 
assertions of Hull’s system would ap- 
pear to depend on the precise interpre- 
tation given to this term in various 
contexts. 

“Learning” itself is notoriously a 
term signifying a process which is 
vague. Note the sort of distinction 
made by Humphrey (11). He regards 
“learning” as a modification of behavior 
“towards a biologically useful end,” 
but not modification of just as precise 
a nature towards an end that is not 
useful, e.g., a seagull learning to follow 
ships to get food, as opposed to a sea- 
gull learning always to fly directly away 
from ships and starving. This is surely 
an odd insistence and implies a funda- 
mental confusion, expressible in the 
statement that learning is not so-and- 
so, but such-and-such (11, p. 105), 
which fails to lay emphasis on the es- 
sential point that we may define the 
word learning in any manner we please 
as long as we have common agreement 
about that usage. It is unnecessary to 
add that many different usages of 
“learning” do in fact exist, although 
their differences are not always, by any 
means, made explicit. 

The last example we can briefly re- 
mark on is the distinction made by 
some psychologists between insight and 
trial-and-error types of learning. If 
learning is not equatable with perform- 
ance, then it is not immediately obvious 
that sudden changes of performance 
are sufficient to build a terminological 
distinction, and in any event, such dis- 
tinction needs to be formulated with 
precision to be useful to science, 
whereas in the past it appears merely to 
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refer to different philosophical direc- 
tives. If this is a genuine nonverbal 
distinction, then it needs to be formu- 
lated in terms that allow some clear 
operational distinction. This appears 


typical of the lack of rigor in the exist- 
ing state of much psychological theory. 


Mortar AND MOLECULAR 


A word on the interpretation given 
to the words “molar” and “molecular” 
seems apposite as these words are in- 
deed many meaninged. 


S, 


Fic. 1. 


Granted that we see all of science on 
a simple plan of assumptions, conse- 
quences of assumptions, tests of conse- 
quences, etc.—going on indefinitely in 
a spiral, where we might equally well 
have written postulates, theorems, ex- 
periments to test theorems, postulates, 
etc. Granted also that we accept the 
need for precise propositions on some 
sort of a hierarchical model as has 
been suggested, with the added flexi- 
bility allowed by glossary and con- 
textual definitions, then we should also 
expect to find that the molar-molecular 
apparent dichotomy really represents 
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two points on a continuum, at least in 
one sense. This distinction is impor- 
tant because propositions which are 
vague, nontestable, etc. on molar levels 
may be testable on molecular, and vice 
versa. This even applies to different 
molar and different molecular levels. 
The problem can be simply illustrated 
diagrammatically. Figure 1 is a rep- 
resentation of the essence of behavior 
where Ss stand for stimuli, Rs for re- 
sponses and O for the organism—to 
be regarded essentially, of course, as in 


R 





A schematic representation of a pictorial model of behavior. 


an environment and in a space-time 
manifold. The molar theorist now acts 
as shown in Fig. 2. He attempts to 
describe behavior in terms of varied 
and complex stimulus and _ response 
conditions without carrying out tests, 
observations, etc. on the internal state 
of the organism. Thus the vagaries 
that attend making inferences about 
learning from performance are ac- 
counted for by logical constructs (these 
are represented by the “tags” in Fig. 
2, strung together and involving any 
number). A consideration of latent 
learning, delayed reward, etc. from 
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molar theory itself is sufficient to show 
internal variation cannot always be ne- 
glected, even if this was not obvious 
from our knowledge of physiology, 
endocrinology, etc. Figure 3 illustrates 
in essence. 


the molecular viewpoint 
Here the molar constructs are broken 
down into molecular constructs where 
at represents diagrammatically the sub- 


divisions of [_]. 


S 


The great advance is 
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A schematic representation of a pictorial model of molar theory. 


simply that A can be regarded as part 
of {_) and part of a molar theory and 
testable as such, and at the same time 
is testable on the neurological, endo- 
crinological level. The double purpose 
is served of making explicit our molar 
constructs and opening up a further 
testing ground for their validity. 

This whole matter is complex but the 
term molecular should be regarded 
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A schematic representation of a pictorial model of molecular theory. 
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roughly in the light of tying our molar 
constructs to the organic data and thus 
hoping to reduce the work to be done 
by a single logical construct and thus 
to reduce the surplus meaning, and also 
the vagueness. 


SUMMARY 


It seems necessary to build precise 
language systems for behavior theory 
which allow for the breadth of the sub- 
ject and which at the same time main- 
tain some degree of rigor. The gaps 
between common languages and nar- 
row postulational techniques need to be 
filled. The method suggested is 
straightforward, although sometimes 
exacting. We must accept a _ broad 
postulational form and be prepared to 
state our undefined terms, postulates, 
logical constructs, etc. and then allow 
for a nucleus of undefined terms that 
are ostensively defined and for “philo- 
sophical directives” which should be 
stated as fully as possible. Then we 
are left with theorems involving many- 
meaninged, multiordinal terms and com- 
plex undefined concepts, which are best 
treated by a glossary definition, sup- 
plemented by a contextual definition. 

Further it will be recognized that we 
are, in language, on a possible infinity 
of levels of abstraction and we must 
differentiate between propositions about 
psychological theory (this paper is on 
this level) and propositions within psy- 
chological theory. 

The attempt has been made to 
enumerate properties, or in Woodger’s 
terms, to deal with the members of the 
class as individuals, rather than to talk 
of classes, as talk of class—more espe- 
cially definite classes—leads to an un- 
welcome arbitrariness and vagueness. 

Lastly, a few of the more obvious 
possible verbal confusions have been 
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pointed out in psychological theory as 
a starting point for a verbal and se- 
mantic overhaul which seems vital to 
the future of behavior theory. 
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It is well recognized that ordinarily, 
if not always, any object that is per- 
ceived is perceived either as a certain 
kind of object or as a familiar object 
or both. Almost every one agrees that 
most examples of this characteristic of 
perception imply some kind of trace or 
other prior existing process or organiza- 
tion which has been built up with past 
experience. Terms which have been 
used by psychologists to refer to 
these prior existing somethings are 
“concepts,” “stereotypes,” “schemata,” 
“traces,” “trace systems,” “frames of 
reference,” “enduring cognitive organi- 
zations,” “cognitive maps,” “belief 
value matrices,” “conceptions,” “cate- 
gories,” “meanings,” etc.2 A review 


of the literature employing these terms 


reveals two interesting points. First, 
there is not nearly as much overlap as 
one might expect. Psychologists using 
the term “concepts” have been inter- 
ested almost exclusively in concept for- 
mation or development. Those using 
the terms “traces” or “trace systems” 
(14, 15, 16) have been concerned with 
the process of communication of the 


1 This paper, which is intended as a founda- 
tion for a forthcoming series of experimental 
papers, was written during the spring semester, 
1952, while the author was a member of the 
Language and Symbolism project at the Uni- 
versity of Michigan, sponsored by the Rocke- 
feller Foundation. It is in part an outgrowth 
of study during a Social Science Research 
Council Fellowship, 1948-49, and in part an 
extension and revision of the author’s unpub- 
lished Ph.D. dissertation, Princeton University, 
1948. Grateful acknowledgment is made to 
Dr. Heinz Werner and Dr. Silvan Tomkins, 
who encouraged and advised the author dur- 
ing his SSRC Fellowship. 

2 Some psychologists prefer to deal with this 
phenomenon in a very different way and do 
not use these terms or any close equivalents. 
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present stimulus with the trace or trace 
system (the Héffding function) or the 
communication of one trace with an- 
other. Psychologists using the terms 
“stereotypes,” “schemata,” or “frames 
of reference” (1, 2, 26, 27) have dealt 
with the functional aspects of these 
constructs in the perceptions, memories, 
thoughts, and behaviors of the indi- 
vidual. Possibly Krech and Crutchfield 
(19) with their discussion of cognitive 
organizations have gone furthest in the 
integration of these various aspects, but 
the research reviewed by Vinacke (33) 
and Leeper (21) illustrates how little 
integration of this kind has been 
achieved among psychologists in general. 

A second point revealed by a review 
of the literature is the paucity of ade- 
quate terms and methods for the identi- 
fication, description, and measurement 
of these constructs. Certainly con- 
cepts, etc. must have many charac- 
teristics and must reveal their existence 
in many different ways, but with a rela- 
tively few exceptions, psychologists 
write as though one NUZ is about like 
any other NUZ, one stereotype about 
like any other, and as far as investiga- 
tions of what are called “concepts” go, 
methods are confined pretty much to 
sorting or labeling objects, and to for- 
mulating criteria, although in the case 
of “stereotypes” and “schemata” meth- 
ods involving selective perception and 
distortions in memory have been em- 
ployed. 

The lack of adequate descriptive 
terms and methods may contribute to 
the striking discrepancies between the 
exciting hypotheses asserted by those 
who write “in the large” about cogni- 
tive processes or linguistic behaviors 
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and the actual results obtained by ex- 
perimental research. Many psycholo- 
gists have tended to ignore the descrip- 
tive and functional problems in favor 
of the genetic problem, i.e., here as in 
some other areas of psychology, psy- 
chologists have been much more inter- 
ested in giving theoretical explanations 
of how something has been learned or 
derived from needs than in describing 
that something itself and demonstrat- 
ing its ways of functioning. The his- 
tory of psychology should give us a 
clue here. The study of perception 
made great strides forward when Katz 
and the gestalt psychologists empha- 
sized the necessity of a description of 
perceptions rather than an “explaining 
away” of perception in terms of asso- 
ciation or an artificial analysis of per- 
ception into hypothetical components, 
as the structuralists had done. Simi- 


larly, the study of behavior has been 
greatly aided by careful attention to an 
adequate description (usually in terms 


of accomplishment, e.g., pressing a bar, 
reaching a goal box) instead of an 
analysis into hypothetical components 
(e.g., reflexes in the Watsonian sense). 
Likewise, understanding of language is 
beginning to profit greatly by a care- 
ful description of utterances that actu- 
ally occur (see, for example, the stimu- 
lating book by Fries [8]). It may well 
be that the psychology of cognitive 
processes will not advance much further 
until adequate descriptive methods are 
available. 

The purpose of this paper is to pro- 
pose a number of distinctions which 
can fruitfully be made in the descrip- 
tion of concepts and which enable the 
use of a common terminology in the dis- 
cussion not only of concept formation 
but also of the functioning of concepts 
in personality. These distinctions are 
also intended to imply a close relation 
between traditional psychological prob- 
lems and more or less psychological 
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hypotheses which have arisen in related 
fields, especially philosophy, sociology 
(the sociology of knowledge), and an- 
thropology and linguistics (especially 
in the writings of Benjamin Whorf [37, 
38]). Some indication will also be 
given of the procedures which are used 
in the actual application of the descrip- 
tive terms. For lack of space the full 
theoretical and applied value of these 
distinctions cannot be argued, but a few 
illustrations are given of how the terms 
apply to some current research. The 
terminology used in the paper is, it 
seems to the author, compatible with 
the usage of “concept” by psychologists 
in the fields of concept formation and 
development. On the other hand, it 
has been necessary to introduce a num- 
ber of distinctions which either have 
not been made explicitly before by psy- 
chologists or, if made, have not been 
labeled. Several of these distinctions 
are very similar to distinctions made in 
logic and semantics, and terms used 
by writers in these fields, especially 
Carnap (3), will be borrowed. The 
letter “c” will be prefixed to borrowed 
terms, to indicate that the usage, 
though similar, nevertheless has im- 
portant differences.* 

Suppose an individual perceives an 
object X as a certain kind of object, 
say A. Then we shall say that this 
individual has a concept C, and we can 
symbolize the perceptual event as 
C(X)= A. We shall call C an “opera- 
tor” at the time it is being used to 
structure perception. 

We shall also use the term “opera- 
tor” for a concept that is being used in 


8 At least to the author these distinctions 
are similar. I do not agree with those who 
maintain that discussions of meaning in phi- 
losophy and in psychology have, or should 
have, nothing in common. 

* The differences between the present usage 
and Carnap’s usage arise from the fact that 
the terms in the present paper take into ac- 
count the individual. 
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an analogous way in cognitive functions 
other than perception, in the strict 
sense of the term, such as what is 
sometimes called “social perception” 
(e.g., perceiving that Mr. F. likes Miss 
G., perceiving that Mrs. H. is insincere, 
etc.), and also in functions like remem- 
bering, thinking, imagining, conceiving. 
In the following discussion, the words 
“perception” or “perceiving” will be 
used for brevity, although the asser- 
tions are intended to cover other cogni- 
tive functions as well. The discussion 
assumes, therefore, as a working as- 
sumption, that the same internal or- 
ganizations that operate in one cogni- 
tive function operate in other functions. 
This assumption is not unverifiable; it 
is already supported to some extent by 
available evidence, but if further re- 
search should demonstrate that it does 
not hold, there will be very little value 
in the notion of concept. 

It should be emphasized that “con- 
cept” in the present discussion is not 
being used as a pure intervening vari- 
able (23). It is assumed that there are 
some kinds of internal organizations 
or processes which are responsible for 
categorical aspects of cognitive func- 
tions, and that these internal organiza- 
tions have properties which we must 
infer from observation. Part of what 
follows will seem curiously empty or 
tautological if this is not kept in mind. 

To refer again to our paradigm 
C(X)= A, the significance of the fact 
that A is not X, that in perceiving the 
world we use only a small number of 
the operators that might be used, and 
that different people, or even the same 
person at different times, may use dif- 
ferent operators, has been discussed by 
so many psychologists, philosophers, 
sociologists, anthropologists, and _lin- 
guists that no attempt will be made to 
discuss its significance here.’ It may 


5 Among those who have remarked about the 
significance of this or related phenomena are 
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be pointed out, however, that the ques- 
tion of whether an individual Aas a 
concept (e.g., whether he can, under 
suitable instructions, make certain sort- 
ings or classifications and/or give a 
satisfactory definition) is probably not 
as important as the question of under 
what conditions the concept will 
operate. 

A considerable amount of psychologi- 
cal research can be grouped under the 
following two problems: 

1. What is the functional significance 
of individual differences in operators 
under certain standard conditions? 
Much of the research in projective tech- 
niques deals with this problem. 

2. What are the factors (other than 
personality) making for the operation 
of one concept rather than another in 
the same individual? Under this head- 
ing would come much of the research 
on the effect of need, set, structural 
factors, and practice on perception. 
For example, Duncker’s experiments on 
“functional fixedness of solution ob- 
jects” (4) can be thought of as deal- 
ing with this problem, as he demon- 
strated that under certain conditions 
once an object has been operated on by 
a certain concept, it is more difficult 
for a different concept to operate on 
that particular object. Under other 
conditions, however, the operation of 
one concept greatly facilitates the op- 
eration of certain other concepts on the 
same object, as for example in racial or 
ethnic prejudice, where the perception 
of an individual as a member of a cer- 
tain group reduces the number or 
strength of cues or information neces- 
sary to trigger the operation of con- 
cepts associated with that group. Prob- 
Helmholtz, Wundt, James, Titchener, Piaget, 
Bartlett, Schachtel, Koffka, MacLeod, Sherif 
and Cantril, Allport and Postman, Krech and 
Crutchfield, Plato, Aristotle, Aquinas, Locke, 
Berkeley, Cassirer, De Laguna, Whitehead, 
Sapir, Whorf, Lady Welby, Lippmann, and 
Korzybski. 
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ably almost any operator increases the 
probability of operation of some con- 
cepts and decreases the probability of 
operation of others. 

In our paradigm, C(X)= A, we call 
A a category of the concept C, and X 
an argument. We say then that a con- 
cept operates upon an argument to 
yield a category.’ The concept C may 
have categories in addition to A. By 
the statement “B and A are both cate- 
gories of C” we mean that whatever 
concept operates on an object X to 
yield A is the same concept as that op- 
erating on some object Y to yield B. 
There are methods for determining 
whether A and B are categories of the 
same C, but it will be easier to describe 
these once we have introduced a few 
additional terms. 

We distinguish between the category 
A and the verbal label which the indi- 
vidual uses for the category or for a 
member of the category. We call such 
a label an “expressor” of the category 
A. A category may have more than 
one expressor. The expressors of a con- 
cept are the expressors of its categories. 

A complete description of a concept 
includes two aspects, content and for- 
mal properties. We shall discuss each 
of these aspects in detail. 

The most obvious way of describing 
concepts is in terms of the assign- 
ment of objects to the concept’s various 
categories. In concept-formation experi- 
ments this is the most commonly em- 
ployed test for determining whether a 
given concept has been formed; im- 
plicitly, therefore, this is the way the 
concept is describec. To explicate this 
notion precisely we shall borrow some 

® The present author likes to call these cate- 
gories “values,” as this usage is in accord with 
the logical analogue (an operator operates 
upon an argument to yield a value) and is 
more appropriate than “category” when the 
notion of a nominal operator, as discussed in 


this paper, is extended to that of an ordinal, 
interval, or ratio operator. 
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terms from logic and semantics. If 
C(X)=-A, then we call X a “c-deno- 
tatum” of A (cf. Morris [24], Russell, 
J. S. Mill, etc.) and also of any of A’s 
expressors. It is convenient to speak 
of the c-extension with respect to a par- 
ticular set of objects S. If an individ- 
ual selects from a set S all those which 
he is willing to classify as A’s, then the 
latter subset is the c-extension of A 
with respect to S. The c-extension of 
a concept, on the other hand, is the 
set of c-extensions of the categories of 
the operator; it is what the mathemati- 
cian calls a partition of a set of objects. 

It should be noted that we have not 
said that any object that Aas certain 
properties is necessarily a c-denotatum 
of a certain expressor; it is at this point 
that we diverge from semantics into 
pragmatics. An object S is, in our 
usage, a c-denotatum of an expressor if 
and only if the individual whose con- 
cept we are studying applies that ex- 
pressor, whether X actually has certain 
properties or not. 

A description of a concept purely in 
terms of its extensions with respect to 
certain sets of objects leaves something 
out, namely what, in a certain sense, 
the individual “means” by his expres- 
sors." We shall call the properties 
which the individual perceives * (or be- 
lieves or conceives) the c-denotata as 
having the “c-intension” of the cate- 
gory (and its expressor). In some 
cases we can distinguish between the 
defining c-intension and the incidental 
c-intension, ie., between those proper- 
ties which define the category and those 
which tend to be attributed (perhaps 
with varying strengths of hypotheses) 

7 The inadequacy of a description in terms 
of extensions is especially obvious in those 
cases in which the extension is null (eg., 
“unicorn”) or has only one member (e.g., 
“myself”). 

8The word “perceives” is used in the 


broadest sense, to include effects of cues of 
which the individual may not be conscious. 
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to any c-denotatum but are not defining 
properties.° For example, part of the 
defining c-intension of the expressor 
“Negro” by a prejudiced individual 
would probably be the property of hav- 
ing dark skin and part of the incidental 
c-intension might be the property of 
being innately stupid. Jn most cases, 
however, a distinction between defining 
and incidental c-intensions will be very 
dificult, if not impossible, to make. 
Even in scientific research these two 
parts of a c-intension are not always 
distinguished. If one reviews the his- 


tory of the duplicity theory of vision, 
for example, one looks in vain for care- 
ful definitions of “rods” and “cones”; 
as a consequence of this ambiguity, 
many of the statements made in the 
development of the theory (including 


® The term “defining c-intension of the ex- 
pressor,” as here defined, has a meaning 
somewhat similar to that of Frege’s “Sinn” 
(5), Carnap’s “intension” (3), Morris’s “sig- 
nificatum” (24), Lewis’s “signification” (22), 
J. S. Mill’s “connotation,” etc., except that 
these terms are usually used by the authors 
with reference to languages or semantical sys- 
tems instead of with reference to one indi- 
vidual’s usage, or else they are used in a 
more restricted sense technically (e.g., cf. Mor- 
ris). Carnap’s “intension” includes the defin- 
ing c-intension plus all its logical implications, 
which, in the language of the present paper, 
may or may not be part of the incidental 
c-intension. For example, the intension, in 
Carnap’s sense, of the expressor “positive 
integer” would include the property of hav- 
ing a unique factorization into prime factors, 
as it has been proved that all positive integers 
do have unique factorizations, whereas this 
property would not for any individual’s ex- 
pressor be part of the defining c-intension, 
and would be a part of the incidental c-inten- 
sion if and only if the individual using the 
expressor “positive integer” knew the unique 
factorization theorem or for some other rea- 
son believed that positive integers have this 
property. 

The term “intensional” is used by Korzybski 
and Hayakawa in a way quite different from 
the present usage of “defining c-intension,” 
though their usage does have some similarity 
to that of “incidental c-intension.” 
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the general statement of the theory it- 
self) are confusing mixtures of defini- 
tions, tautologies, and empirical as- 
sertions, and the extent to which these 
statements are verified by certain ob- 
servations is left indeterminate. It is 
well recognized that in some cases in 
which this distinction can be made we 
categorize an object which clearly satis- 
fies a defining c-intension, then assume 
without conscious inference that the in- 
cidental c-intension is also satisfied. 
Work on prejudice and stereotypes has 
emphasized this mechanism, discussed 
by psychologists under the term “as- 
similation” (Allport and Postman |1]), 
“frame of reference” (Sherif and Can- 
tril [27]), or “incorporation into or- 
ganized cognitive structures” (Krech 
and Crutchfield [19]). 

We have defined “c-intension of a 
category.” ‘The defining and incidental 
c-intensions of a concept are the sets of 
defining and incidental c-intensions of 
its categories. 

Notions like that of c-intension are 
strongly rejected by many of those in- 
terested in problems of language and 
meaning. The argument runs thus: 
There are utterances (verbal behaviors), 
other behaviors, and objects; there may 
even be images, feelings, and other im- 
plicit responses, but look wherever you 
will, there are certainly no intensions of 
any kind, and to drag in some notion of 
properties being attributed to objects 
is only to introduce confusion, pseudo- 
problems, and a lot of fruitless verbiage. 
This argument does not convince the 
present author; to him the elimination 
of notions like c-intensions, though per- 
haps theoretically possible, greatly han- 
dicaps any attempt to deal with mean- 
ing and leads to rather cumbersome and 
peculiar circumlocutions.’® 


10Qne should not assume too hastily that 
because certain abstract entities do not exist 
in a spatio-temporal sense they are thereby 
eliminable in a_ theoretical analysis. The 





246 


There are two methods of investigat- 
ing c-intensions. Perhaps the most ob- 
vious is to ask the subject what he 
means by a given expressor, thereby ob- 
taining a formulated c-intension (a 
definition is a certain kind of formu- 
lated c-intension, namely, one which 
formulates a set of properties both nec- 
essary and sufficient for class member- 
ship; cf. Morris’s “formulated signifi- 
catum” [24]). However, many labora- 
tory investigations as well as everyday 
observations have shown that formu- 
lated c-intensions may be very inade- 
quate. Several investigators (9, 10, 11, 
25, 28, 29) have shown this to be the 
case even in situations where the de- 
fining c-intensions can easily be formu- 
lated in the language available, which is 
not the case with certain everyday ex- 
pressors, such as “dog.” Exactly what 
is meant by “adequacy of a formulated 
c-intension” will be stated below. At 
any rate, the obvious inadequacy of 
formulated c-intensions necessitates the 
use of the second method, which is in- 
ference from properties which c-deno- 
tata have in common, at least those 
properties which were easily perceivable 
by the subject during the conditions 
under which the objects were made c- 
denotata. It should be emphasized 
that not all properties held in common 
by the c-denotata of a category are in- 
cluded in the c-intension of that cate- 
gory; only those which are perceived, 
believed, or conceived are so included. 
Further, some properties which the c- 
denotata do not actually have may be 


reader interested in logic and mathematics is 
referred to Alonzo Church’s paper, “The need 
for abstract entities in semantic analysis,” 
Proceedings of the American Academy of Arts 
and Sciences, Vol. 80, No. 1, July, 1951, for 


discussion of a related point. For some of 
the circumlocutions which are necessary to 
avoid the use of certain abstractions in mathe- 
matics, see the paper by Goodman and Quine, 
“Steps toward a constructive nominalism,” J. 
symbolic Logic, 1947, 12, 105-122. 


Joe ApAMs 


part of the c-intension. When an in- 
dividual learns that the c-denotata have 
a property he did not previously know 
about, that property may become part 
of the incidental c-intension, though 
with time it may be absorbed into the 
defining c-intension. 

With the preceding terms we can dif- 
ferentiate several kinds of content simi- 
larity between two categories. First, 
c-extensional equivalence** with re- 
spect to a set of objects S, which is sim- 
ply the degree of extensional overlap 
(intersection of the two extensions), 
definable quantitatively. Second, c- 
intensional equivalence, or degree of 
similarity in c-intensions. In some 
cases this can be split up into defining 
c-intensional equivalence ** and _inci- 
dental c-intensional equivalence. These 
terms can also be used with reference 
to concepts. The c-extensional equiva- 
lence of two concepts with respect to a 
set S is the overlap ** in the two respec- 
tive partitions (sets of c-extensions). 
The c-intensional equivalence of two 
concepts is the similarity in the sets of 
c-intensions of the respective sets of 
categories. 

It was stated earlier that a category 
may have more than one expressor. 
Given two expressors, “A” and “B,” 
how do we decide whether they express 
the same category? There are several 
criteria to use: 

1. Perfect c-extensional equivalence 
of “A” and “B” with respect to any set 
of objects. 

2. Perfect c-intensional equivalence, 
in so far as it can be inferred from c- 


11 Cf, Carnap’s “equivalence” (3), Johnson’s 
“extensional agreement index” (12). 

12 Cf. Carnap’s “L-equivalence,” which is an 
either-or kind of predicate not involving simi- 
larity. 

18The overlap of two partitions of a set 
can be precisely defined in several ways. For 
the purposes of this paper a definition is not 
necessary. 
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extensional equivalence and from for- 
mulated c-intensions. 

3. A decrease in time required to de- 
cide whether to apply expressor “B” if 
the decision whether to apply expressor 
“A” has already been made, i.e., a de- 
crease, compared with the time required 
to make the decision for similar objects 
not previously decided upon with re- 
spect to “A.” 

4. Indication from the subject, verbal 
or otherwise, that for objects for which 
the decision whether to apply “A” has 
already been made, the decision whether 
to apply “B” has also been made and 
explicitly indicated. In other words, 
that “B” “has the same meaning 
nA” 

If all these criteria are met, good 
evidence has been obtained that “A” 
and “B” express the same category. In 
some cases criteria 1, 2, and 4 are suf- 
ficient, as with some expressors and 
some sets of objects decision time is so 
short anyway that no decrease can be 
observed. 

Even if “A” and “B” are not ex- 
pressors of the same category, they may 
be expressors of the same concept, i.e., 
“A” and “B” may be different yet both 
be categories of “C.” It was asserted 
earlier that there are methods for deter- 
mining this. We can now list the cri- 
teria for deciding whether “A” and “B” 
are nonoverlapping categories of “C”: 

1. Zero (or near zero) c-extensional 
equivalence. 

2. A decrease in time required to de- 
cide whether X is a c-denotatum of 
“B” when it has already been decided 
whether X is a c-denotatum of “A.” 

3. An indication, verbal or otherwise, 
from the subject that if X has already 
been made a c-denotatum of “A” then 
the decision that X is mot a c-deno- 
tatum of “B” has already been made 
and explicitly indicated. 

4. An indication from the subject 
that he feels or believes that the judg- 
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ment of whether X is a c-denotatum of 
“A” is the same kind of judgment as 
that of whether X is a c-denotatum of 
“B.” (Don’t depend on the subject, 
however, to explicate what he means 
by “same kind”!) 

The fulfillment of these criteria con- 
stitutes evidence that “A” and “B” are 
expressors of nonoverlapping categories 
of “C.” The problem of overlapping 
categories or of hierarchical structure is 
more complicated and no attempt will 
be made to list criteria here. 

Most psychologists would probably 
agree that almost any word and the 
same word with “not,” or the prefix 
“un” or “in” preceding it are expres- 
sors of the same concept. By using the 
above criteria we can verify this more 
or less obvious intuition, at least with 
human subjects. Certainly any ex- 
pressors of the form “E” and “Not E” 
will have zero, or near zero, c-exten- 
sional equivalence. This is one way of 
stating the law of contradiction. It will 
in general take a subject less time to 
decide, for certain kinds of objects at 
least, whether to call an object “E” if 
he has already decided whether to call 
it “Not E.” Further, if the expressor 
“E” has already been applied to an ob- 
ject, it will seem to the subject that the 
same kind of judgment is called for, 
that he has already decided not to ap- 
ply the expressor “Not E,” and that he 
has already explicitly indicated that 
he would not apply “Not E.” Words 
called “opposites” also tend to satisfy 
these criteria. 

Notice that there is no mention above 
of the following condition: if X is not a 
c-denotatum of “A” then X will be a 
c-denotatum of “B.” This condition 
will be fulfilled in certain cases, but 
certainly not in general. Even expres- 
sors of the form “E” and “Not E” 
will not always fulfill this condition, 
e.g., merely because an individual will 
not apply the expressor “liberal” to a 
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given person does not mean that he 
will apply the expressor “not liberal,” 
and this may hold even if he considers 
the liberality of the person and tries 
to make a decision. In other words, in 
pragmatics a law of excluded middle 
does not hold. 

The foregoing discussion of content 
description makes it easier to discuss 
the formal properties of concepts. The 
following is a list of some of these 
properties: 

1. Structure. By this is meant sim- 
ply the number of categories and the 
relation between them. Possibly there 
exist concepts with only one category, 
e€.g., concepts expressed by “entity,” a 
word which is applied by some writers 
to any object, relation, etc., abstract or 
concrete. Most concepts, however, 
have at least two categories (though it 
may be that for many concepts only 
one category has any denotata), which 
is the simplest case of nominal or quali- 
tative structure. Some nominal struc- 


tures may be rather complex, as when 
one category includes two or more other 


categories. A fairly common hypothe- 
sis among psychologists is that, as the 
individual develops, his concepts have 
a more and more complex structure (cf. 
Welch and Welch and Long on hier- 
archical structure [34, 35]). 

Some other types of structure are 
ordinal, interval, and ratio. It may 
seem surprising that these terms, de- 
veloped in the theory of measurement 
(30), should turn up in this context. 
Actually, however, the psychology of 
concepts and the theory of measure- 
ment are intimately related, in fact, a 
really adequate theory of measurement 
requires an adequate psychology of 
concepts, whether one calls it that or 
not, a fact overlooked by some writers 
in the former field. In the case of 
interval and ratio operators, and some- 
times with ordinal or nominal operators, 
the operation of looking or otherwise 
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directing the sense organs and making 
a judgment is supplemented by certain 
prescribed manipulations of a more or 
less complex kind, but in all cases the 
end result is a category or value, and 
the problem of adequate description in 
terms of c-extensions, c-intensions, for- 
mulations, equivalences, etc., is just as 
much present in the case of interval 
and ratio operators (the problem of 
what the dimension means) as in the 
case of nominal operators.** 

It seems apparent that there are im- 
portant individual differences in opera- 
tor structure. Frenkel-Brunswik (6, 7) 
has shown that some children think in 
dichotomies (weakness-strength, mascu- 
line-feminine, right-wrong) more than 
others, and she maintains that such 
dichotomous operators are generally 
characteristic of the ethnically preju- 
diced and the authoritarian personali- 
ties. Klein (13) maintains that a fun- 
damental aspect of personality is re- 
vealed in individual differences in op- 
erator structure. Korzybski attributed 
many ills of the world to dichotomous 
operators (“two-valued, either-or orien- 
tations”), and some psychologists more 
or less agree with this kind of assertion. 

2. Decidability. We call a concept 
decidable with respect to an argument 
X if X can be categorized, in a way 
satisfactory to the individual, by that 
concept acting as an operator, otherwise 
undecidable. We call X c-decidable or 
c-undecidable. The word “vague,” 
when applied to concepts, sometimes 
means c-undecidability, though other 
meanings are low c-extensional equiva- 
lences and inadequacy of formulated 
intensions (see below). The inability 
to tolerate c-undecidability is probably 
related to intolerance of ambiguity as 
discussed by Frenkel-Brunswik (6, 7); 


14 Stevens (30) mentions only the most un- 
interesting case of nominal structure, that in 
which labels (numerals) are used for identifi- 
cation marks, without meaning (intension). 
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in her discussion, however, she stresses 
dichotomous (or other relatively undif- 
ferentiated) operators rather than ex- 
treme c-undecidability. 

3. Availability. This is inferred from 
the conditions under which a concept 
will or will not operate. A measure of 
availability can be the amount of in- 
formation necessary for operation, the 
speed of operation, etc. Availability 
is obviously related to the Hoffding 
function. 

In an experiment using incomplete 
figures similar to those of the Street 
Gestalt Completion Test (31), Verville 
and Cameron (32) found age and sex 
differences in the speed of operation of 
concepts (i.e., time between presenta- 
tion of a stimulus and the perception 
of it as a certain kind of thing). 
Whether this result would hold with 
other stimulus materials and other con- 
ditions of operation has not, to the 
present writer’s knowledge, been inves- 
tigated, nor has the extent or signifi- 
cance of individual differences in this 
property been demonstrated. The speed 
of operation of a concept is analogous 
to the latency of a response, but this 
property has not been fully exploited 
either as a measure of how well a con- 
cept has been formed, analogously to 
the use of latency as a measure of habit 
strength, or as a diagnostic measure in 
personality investigations. 

4. Adequacy of formulation of the 
c-intension. If an individual formu- 
lates in the language L the c-intension 
of his concept C, then the interpreter 
I forms a concept by interpreting the 
formulated intension. We can dis- 
tinguish three kinds of adequacy: 

a. C-extensional adequacy with ex- 
spect to a given set S. This is the c- 
extensional equivalence of the two con- 
cepts, C and the interpretation (in L) 
of the formulated c-intension. Smoke 
(28) found that some of his subjects’ 
formulated intensions were not com- 
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pletely c-extensionally adequate, even 
with respect to the set of objects used 
as arguments in the formation of the 
concepts. 

b. Defining c-intensional adequacy. 
This is the defining c-intensional equiv- 
alence of the two concepts. 

c. Incidental c-intensional adequacy. 
This is the incidental c-intensional 
equivalence of the two concepts. 

In scientific arguments over defini- 
tions considerable confusion results 
from failure to specify the kind and 
degree of adequacy expected or re- 
quired. This point could with justifica- 
tion be considerably elaborated. 

5. Degree to which the c-intension 
is conscious. To some psychologists 


this property will seem either unintel- 
ligible or reducible to property 4. The 
writer does not share this view; any 
close tie-up of the psychology of con- 
cepts with either psychoanalytic theory 
or with phenomenology presupposes an 
investigation of this property. 


6. Validity (with respect to a given 
set of objects). The extent to which 
the c-denotata of the categories actually 
satisfy the respective c-intensions. If 
one categorizes, as in the Ames demon- 
stration (20) a distorted room as a 
rectangular room, for this particular 
object in this situation his operator has 
very low validity. The validity of op- 
erators is one (but not the only) aspect 
of contact with reality. 

7. Domination. The extent to which 
various behaviors and, experiences of 
the individual are functions in a given 
situation of the result of the operation 
of the concept. We call such a concept 
a dominator in that situation. Three 
kinds of domination can be distin- 
guished: behavior, affect, and percep- 
tion. Refusing to shake hands with a 
Negro, feeling negative toward him, and 
seeing something wrong in everything 
he does are often examples of behavior, 
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affect, and perception dominators, re- 
spectively.’® 

We can give a partial description of 
a paranoid condition by saying that 
the individual is behavior, affect, and 
perception dominated by highly invalid 
operators. 

In addition to the many problems in 
the field of concept formation which 
the foregoing distinctions suggest, we 
can state, using these distinctions, in 
fairly precise form a rather comprehen- 
sive set of theses about cognitive func- 
tioning as follows: Concepts which are 
describable in terms of formal proper- 
ties and content ** play a demonstrable 


15Jt is well recognized that expressors of 
dominators are sometimes re-defined in an at- 
tempt to change the extensions and/or inten- 
sions and yet keep the domination. C. S. 
Stevenson, in his book Ethics and Language, 
calls such a definition a “persuasive definition.” 

16 There is a school of thought which holds 
that concepts, if they are to be talked about 
at all, must be defined and described in terms 
of behaviors rather than in terms of formal 
properties and content as described in this 
paper. It is obvious that behavioral indices 
must be used (such as sorting, discriminatory 
responses, labeling, autonomic responses, etc.) 
in the determination of c-extensions and c- 
intensions; however, the present author be- 
lieves that any attempt to define and describe 
concepts in terms of behaviors will turn out 
to be about as hopeless and fruitless as was 
the older attempt to describe concepts in terms 
of images. Behavior is the result of so many 
variables that it is difficult to see why anyone 
regards behavioral definitions of any one vari- 
able (of any appreciable complexity) as other 
than a hopelessly complicated task. It should 
be noted that descriptions existing in the pres- 
ent literature do describe concepts in terms of 
c-extensions and c-intensions, though of course 
they use behavioral indices in obtaining these 
descriptions. For a statement of a point of 
view held by some structural linguists and 
some psychologists and rather divergent from 
that presented in this paper, see Bloomfield, 
L., “Language or ideas?” Language, 1936, 12, 
89-95. 

A more serious objection to the present dis- 
cussion runs as follows: the author seems to 
endorse the ancient view that concepts are 
entities which are carried around by the indi- 
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role in the formation and organization 
of experience and behavior; the formal 
properties and the content of frequent 
operators are related to the personality 
structure of the individual; the content 
and formal properties of the operators 
frequently and characteristically used 
by members of a given culture are 
strongly related to the lexicon and 
grammar of the language used in the 
culture; potential dominators, i.e., con- 
cepts which if they operated would 
dominate, tend to be operators. If any 
of these theses is true, the psychology 
of concepts overlaps with many other 
areas of psychology and has important 
implications for many other disciplines 
and applied fields. 
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The present paper ‘is written to show 
that the Hullian theory of behavior is 
capable of including an expectancy con- 
cept which may be likened to that of 
Tolman (5). It is proposed to derive 
this concept of expectancy as a theorem 
to Hull’s 1943 postulate set (3). Then, 
in the language that Hull has preferred, 
the notion of expectancy will be a corol- 
lary. It will be necessary to explain 
first what is meant by the word ex- 
pectancy in the present paper, and how 
the notion will be conceptualized. Then 
the assumptions which are necessary 
will be set forth, followed by the de- 
duction. 

The starting point for the present 
notion will be the sign-gestalt-expecta- 
tion of Tolman (5). We quote Tol- 
man: “In the case of a sign-gestalt- 
expectation the organism is ready either 
as a result of perception (q.v.), mne- 
monization (q.v.), or inference (q.v.) 
to have some sort of positive or nega- 
tive commerce with an immediately 
presented object (i.e., the sign-object) 
as the means-end-route (i.e., the signi- 
fied means-end-relation) to get to or 
from such and such a further object 
(i.e., the signified-object)” (5, p. 453). 
It is thus seen that the sign-gestalt- 
expectancy is a relation between signs 
such that the first occurring sign elicits 
an approach to or an avoidance of the 
later occurring sign. Special emphasis 
is placed on the wholeness or the unity 
of this trinity, and the fact that it di- 
rects behavior. 

If the above quotation were to be 
translated into the language of Hullian 
theory (it is hoped, without doing vio- 
lence either to Hullian concepts, or to 
Tolman’s terminology), it would read 


like the following: The goal object 
evokes a goal response. On successive 
trials stimuli in the immediate environ- 
ment come to signal the occurrence of 
the goal object. Gradually, as the 
number of trials increases, stimuli 
which are more distant from the goal 
object acquire this property of signaling 
the occurrence of the goal object. 
Eventually the entire experimental ap- 
paratus will possess directive value for 
the organism. The directive properties 
which these stimuli acquire result from 
their relation with the goal object (in- 
centive stimulus). It is this relation- 
ship between the incentive stimulus and 
other stimuli in the experimental situa- 
tion that constitutes the expectancy. 

The expectancy will be conceived as 
a power of a relation between relations 
which hold between stimuli and re- 
sponses, i.e., the expectancy is seen as a 
relationship which holds between two 
relations which take stimuli and re- 
sponses as arguments.’ First, the rela- 
tion evokes, which holds between a 
stimulus complex and a response com- 
plex, will be considered to be irreflexive 
and asymmetrical. Second, the rela- 
tion leads to, which holds between re- 
sponse complexes .and stimulus com- 
plexes, will be considered to be irre- 
flexive and asymmetrical also. The ex- 
pectancy will then be understood as a 
powe: (po) of the relation relative 
product holding between the relation 
evokes and the relation leads to. 

The notion of the relative product is 
introduced by Whitehead and Russell 
in their Principia Mathematica in Chap- 
ter 34 of Volume I (6). The relative 


1 The arguments of a relation are the terms 
that are related by that relation. 
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product is a relation between two rela- 
tions when there is some term to which 
the members of the domain of the first 
relation are related by the first relation, 
and to which the members of the con- 
verse domain of the second relation are 
related by the second relation. That is 
to say, if we let “R” and “S” denote 
relations, and “(R/S)” denote their 
relative product, then the relative prod- 
uct may be defined as 


(R/S) = psf8{ (Ay) -xRy-ySz} 


The power of a relation is the rela- 
tion between two terms when one is a 
finite number of relation steps removed 
from the other. We quote Woodger 
who says of the relation: “R,, is the 
relation of being related by some power 
of R. Thus we have xR,,.y when y 
can be reached from x in some finite 
number of R-steps, i.e., when 


xRv-vRw: ... -zRy 
or «(RU RURY ...)v” 
(7, p. 39). R,, is defined as follows: 


(Ru C: u.R‘x C u)Dyeu.} 


R,. will be the relation of x to y if it 
is the case that, whatever class u may 
be, if R“w are included in wu and if the 
R‘x are included in u, then y « u.” 

We are now ready to start the deriva- 
tion. It is necessary first to find in 
Hullian theory the relation evokes and 
the relation leads to. The first of these 
relations is easily found in postulate 11 
of Principles of Behavior (3). Postu- 
late 11 asserts “The momentary effec- 
tive reaction potential (s#,) must ex- 
ceed the reaction threshold (s/p) be- 
fore a stimulus (S) will evoke a response 
(R).” This postulate provides the re- 
lation evokes, although the reader will 
note below that we have taken only the 
word statement of the postulate. The 
symbolic statement given in Principles 
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of Behavior is defective and we prefer 
the sentence of assumption 5 to the one 
listed by Hull in his book. This postu- 
late 11 is now contained in the B part 
of the 14th postulate in the new postu- 
late set (4). 

The notion of a response leading to 
a stimulus complex is not so easily 
found. This notion does not appear 
explicitly in Hull’s theory, although it 
is introduced informally (and inciden- 
tally is assumed by many of the stu- 
dents of Hull—and students of students 
of Hull). For example, the diagram 
on page 96 of Principles of Behavior 
implies that response may lead to 
stimulation. Also in some of his earlier 
work, see especially (2), Hull assumes 
that response will lead to stimulation. 
At this point, then, two courses are 
open: (a) One may proceed on conven- 
tion and simply assume that the Hul- 
lian theory contains the notion of a re- 
sponse’s leading to stimulation. Or, 
(6) one may stay within the formal 
theory and derive this notion by the 
method of substitution. The latter al- 
ternative has been chosen in this case 
for the reason that it is desirable to 
remain within the theory in the present 
derivation, and, despite the wide ac- 
ceptance of the convention mentioned 
in alternative (a), the acceptance of 
this convention goes beyond the formal 
theory. It was felt that it was better not 
to go beyond the theory in cases where 
it is desirable to show that the theory 
(the formal theory) contains the no- 
tions with which one wishes to work. 

Corollary XII in the chapter on /n- 
hibition and Effective Reaction Poten- 
tial asserts: ““Whenever conditioned ‘e- 
actions are evoked, whether reinforced 
or not, reactive inhibition (/,) is gen- 
erated” (p. 290). We shall substitute 
in this corollary the notion of a stimu- 
lus complex for the notion of reac- 
tive inhibition, obtaining the following 
lemma: 
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Lemma I. Whenever conditioned re- 
actions are evoked, whether reinforced 
or not, stimulus complexes are gener- 
ated. We may now define the notion 
of a response leading to stimulation as 
equivalent to lemma I. 

Following is a list of the assumptions 
which are necessary for the derivation 
which follows: 


1. The logical type of the symbol which 
denotes reactive inhibition is the same as 
the logical type of the symbol which de- 
notes stimulus complex. This assumption 
is necessary for the justification of the 
substitution through which we arrived at 
lemma I from corollary XII. 

2. Assume an experimental situation such 
that the stimulus complexes may occur in 
serial order, and assume also a succession 
of trials in this experimental situation. The 
serial order may be temporal only, or it 
may be temporal and spatial. 


RicHarp A. BEHAN 


3. Assume that the effective reaction po- 
tential of responses with which we wish to 
deal is in excess of the reaction threshold 
in each case. 

4. Assume the notions of reinforcement 
and secondary reinforcement. 

5. Assume postulate 11. 

6. Assume corollary XII. 

7. Assume that if the i stimulus evokes 
a response, then this response is the i™ re- 
sponse. 

8. Assume that if the i” response leads 
to a stimulus complex, then this is the i+ 
1st stimulus complex. 

Assumptions (7) and (8) specify prop- 
erties of the experimental situation. 


We may now proceed directly to the 
derivation. Some of the assumptions 
will be listed below in symbolic lan- 
guage, although not all will enter di- 
rectly into the derivation. We let “j,” 
“k,” “l,” and so forth be variables for 
number signs, 


” denote the jth occurring stimulus complex in the experimental situation, 


” denote the incentive stimulus, and the i“ occurring stimulus complex, 


“ee 
Sj 
“”” denote the jth occurring response complex in the experimental situation, 
“S, 
“i 


” denote the stimulus complex at the beginning of the experimental situation, and 


the s* occurring stimulus complex, 


“s r;’’ denote “the stimulus complex s; evokes the response r;,” 
“raps,” denote “the response r; leads to the stimulus complex s,.” 
The symbolic statements of the necessary assumptions follow: 


Assumption 3. 
Assumption 5. 
Assumption 6. 
Assumption 7. 
Assumption 8. 
Derivation: 
Postulate 11 
Assumption 3. (Spec.) 
(1). (2). Modus Ponens 
Corollary 12 
(4). [S/Ip, 2/y, @(Gu)/G) 
(5). (Concretion) 


SiO; 


(j, k)sj;Erz > sjLrz 

sjler; > sjlrj. D .5;07; 

(x): (Ay) .xeR.yele. D .Ex D Gy 
(j,k) .sorn Dj =k 

(j,k) .rWsk DR=j+1 


sjlr; > $;L17;. 7 SiOr; 
sjlr; > syLr; 


(x): (Ay).xeR.yelr. D .Ex D Gy 
(x): (As). xeR.zeS. D .ExD 4(Gu)z 
(x): (As) .xeR.2eS. D .Ex D Gs 
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Define 
(6). (7) 
(3) . (8) 
(9) .C9/t.k/j] 
(9) .[1/j.k/i] 


(9) .[s/j.1/i] So nin 


(12). (11). (10). (9). 
(Assumption 2) 


Define’ 
(13). (14) 


(15). 54(|¥) posi. 

(16) is a relation between the rela- 
tion evokes and the relation leads 
to, ie., (16), which is the relative 
product of these two relations, is the 
expectancy. It will be noted that (15) 
is just as good as (16), indeed, it is 
a good deal more expressive. One 
may think of the process of expectancy 
development as a process which results 
in bringing more stimulus complexes 
into a particular relation with the goal 
stimulus (incentive stimulus). The ex- 
pectancy, as it is developed in the pres- 
ent case, is a relation mediated by re- 
sponse, taking stimulus complexes as 
arguments. 

In the quotation from Tolman in 
the second paragraph of this article the 
following phrase appears: “In the case 
of a sign-gestalt-expectancy the organ- 
ism is ready either as a result of percep- 
tion (q.v.), mnemonization (q.v.), or 
inference (q.v.) to have some sort of 
commerce ....” Each of these three 


2 This is the definition of relative product 
which appears in Principia Mathematica (6). 
The symbolization has been modified to suit 
the purposes of the present usage. 


rs; = pzs(x): (Az) .xeR;.26S;. D .Ex D Gz 
rysi 

Sir; . 1S; 
SPT e.TLWS; 


S1O7 1. TWSk 


Ssh, .TWS1. - 
SEPM e- TVS; SiO; TMS; 


(oly) = DS m5 nf (Arm): SmOm-Tm¥Sn} 
$.(@|P)s1. ‘. 
Se($|¥)s;.55($) |W) si 
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(7) 
(8) 
(9) 
(10) 
(11) 


(12) 


. S171. TSK. 


(13) 
(14) 


.51(6|P) se. 


(15) 
(16) 


terms (perception, mnemonization, and 
inference) designates a mode of. sign- 
gestalt-expectation. The present writer 
takes the liberty of assuming that in 
any actual situation all three modes are 
active. It is assumed further that Tol- 
man is here referring to a completed ex- 
pectancy, such as might be similar to 
our (16). According to lemma I, it is 
reasonable to assume that during the 
development of the expectancy the 
mode of inference plays a very impor- 
tant part. 

At this point, a few remarks about 
lemma I might well be in order. It is: 
obvious that this lemma depends upon 
corollary 12. If it should ever become 
necessary to drop corollary 12 from the 
theory, then lemma I would be elimi- 
nated. In such a case, if it were de- 
sirable to keep the present notion of 
expectancy, it would be necessary to 
make the statement that responses lead 
to stimulation a part of the formal 
theory. Such a statement in the form 
of a postulate would be of great use to 
a ‘theory like Hull’s. For example, 
such an assumption would allow the 
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derivation as a corollary of the postu- 
late of behavioral oscillation (postu- 
late 10) (3). 

The relationship between the notion 
of expectancy, as here presented, and 
the rest of Hullian theory must now be 
considered. Every useful construct must 
have an operational definition and must 
be related to other constructs in the 
theory of which it is a part. The 
beauty of such a derivation as that 
given above is that it necessitates no 
new additions to the list of operational 
definitions in a theory. The derivation 
provides the necessary reduction chain 
to reduce the new concept (1). The 
reduction basis for expectancy in Hul- 
lian theory will be the same as that of 
ake. The derivation also provides the 
necessary connections with other con- 
structs in the theory. Because the 
construct expectancy springs from Ep, 
it will automatically become connected 
with all of the constructs in Hull’s 
theory which contribute to sFp. 

The present paper will close with a 
remark about the importance of the 
possibility of such a derivation as this 
paper contains for one of the traditional 
areas of friction in psychology. Here 
reference is made to the molecular vs. 
molar controversy. These two terms 
refer to relative differences between 
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levels of analysis and not to any real 
fixed differences between approaches 
to the subject matter of psychology. 
Those individuals who begin with a 
relatively molar analysis of the phe- 
nomena of psychology end up with a 
relatively molar analysis of the phe- 
nomena they study. Those individuals 
who begin with a relatively molecular 
analysis of the phenomena of psychol- 
ogy may, if they utilize the proper 
techniques, also have a relatively molar 
analysis of the phenomena they study. 
The molecularites can have the molar 
cake and eat it too. 
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In 1925 Tolman (20, p. 290) de- 
scribed a peculiar regularity in the be- 
havior of rats that was later labeled 
spontaneous alternation. 


A simple T maze was used, and it was 
arranged so that the animal could get back 
to the food box in identical fashion whether 
he chose the left or the right angle of the 
T. Either route met with success... . 
[There was] a very pronounced tendency 
toward continuous and regular alternation 
—left, right, left, right, or right, left, right, 
left. . . . It appeared, in short, that even 
where either side was equally “satisfactory” 
there was in our rats a positive tendency 
left over toward variation of response . . . 
a positive tendency in and of itself. 


Ten years after Tolman’s report, 
spontaneous alternation received inten- 
sive study by Dennis and his co- 
workers. Although these early investi- 
gators did not incorporate their data 
into a formal theory, Dennis (5) did 
imply that spontaneous alternation was 
in some way determined by the stimuli 
the subject faced. For example, the 
title of one article was “Spontaneous 
Alternation in Rats as an Indicator of 
the Persistence of Stimulus Effects,” 
and, in discussion of his results, he 
wrote about “a tendency to avoid a 
specific pathway which has recently 
been traversed” (5, p. 310). This for- 
mulation suggests a theory of spon- 
taneous alternation in terms of the 


1 This paper is based upon a Ph.D. disserta- 
tion submitted in February 1952 to the de- 
partment of psychology at the University of 
Michigan. The writer is indebted to the mem- 
bers of the committee and to Dr. Edward L. 
Walker, chairman of the committee, who gave 
invaluable aid in the preparation and editing 
of this paper. 


stimulus. The rat alternates from one 
alley (a set of stimuli) to the other (a 
different set of stimuli), going to Alley 
B because it has already experienced 
Alley A. The theory proposed in this 
paper develops and systematizes this 
basic approach. 

In direct antithesis to a stimulus- 
oriented theory is one which interprets 
spontaneous alternation in terms of the 
response. In a response theory, the 
animal is viewed as alternating re- 
sponses instead of alleys, i.e., making a 
right turn because it has already made 
a left turn. Many of the more recent 


investigations examined below have 
stemmed from this theoretical position. 


A CRITIQUE OF RESPONSE THEORIES 


Explanation of spontaneous alterna- 
tion in terms of the response has been 
attempted by several Hull-influenced 
investigators: Heathers (9) using the 
concept of performance decrement; and 
Solomon (19) and Zeaman and House 
(23) using the concept of reactive in- 
hibition. According to both concepts, 
the occurrence of a response reduces 
temporarily the probability of its re- 
currence. The concepts were employed 
to explain both the fact of spontaneous 
alternation and the fact, predicted and 
demonstrated by Heathers (9), that 
spontaneous alternation disappears as 
the interval between trials increases. 

The concept of reactive inhibition, in 
addition, assigns an explicit role to the 
number of times the response is made 
and the amount of work required by 
the response. Thus, Zeaman and House 
(23) predicted and found that giving 
animals forced trials to one of two 
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alleys increased the tendency to alter- 
nate to the other; Solomon (19) pre- 
dicted and found some positive but not 
conclusive evidence that increased work 
resulted in increased spontaneous al- 
ternation. 

Although these experiments seem to 
give considerable support to an expla- 
nation in terms of the response, a num- 
ber of facts remain that resist explana- 
tion by this type of theory. Typical 
of these are the following: 

1. Dennis (5) found that rats did 
not show spontaneous alternation from 
unit to unit of a multiple-unit maze. 
That is, they did not go right-left- 
right-left within a single trial on a four- 
unit maze, but showed instead spon- 
taneous alternation in their successive 
choices at a given choice point. They 
might go right-right-right-left on the 
first trial and left-left-left-right on the 
second trial. This finding is in direct 
contradiction to the prediction which 
should follow from a theory in terms 
of the response. 

2. Jackson (11) found that the dif- 
ference between the two possible re- 
sponses at the choice point did not af- 
fect the amount of spontaneous alter- 
nation. A Y maze with a small angle 
of separation, requiring a choice be- 
tween two practically identical re- 
sponses, elicited as much spontaneous 
alternation as a Y make with a wide 
angle of separation. If the response 
were the key to spontaneous alterna- 
tion, then differentiating the response 
should result in more alternation. 

3. Zeaman and House (23), using a 
procedure of ten forced trials followed 
by a free trial, and Walker,’ using 
simply two free trials, found that rats 
showed spontaneous alternation with 
intervals greater than an hour between 
trials. These findings disagree with 


2 Walker, E. L. Unpublished data. 
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expectations on the basis of reactive in- 
hibition which is ordinarily assumed 
to dissipate within a few minutes.® 

4. Wingfield (21) found that human 
subjects showed more spontaneous al- 
ternation in choice of lights that differed 
in color than of lights that were the 
same. Something in addition to con- 
sideration of the subject’s responses 
seems to be demanded by this finding. 

5. Dennis (5) varied the amount of 
work for rats running a maze by in- 
creasing the length of the final section 
from zero to 38 ft. but found no change 
in the amount of spontaneous alterna- 
tion. It would seem reasonable to ex- 
pect that the reactive inhibition from 
the added run of 38 ft. should dwarf 
the effects of reactive inhibition from 
the momentary act of turning right or 
left. 

These difficulties suggest that it 
might be fruitful to turn to the con- 
struction of an explanation in terms of 
the stimulus. As already pointed out, 


Dennis (5) anticipated this type of ex- 


planation. Recently Montgomery (16, 
17, 18), on the basis of empirical find- 
ings of his own, has questioned the 
reactive-inhibition explanation of spon- 
taneous alternation and proposes the 
concept of an exploratory tendency 
which is reduced by exposure to a path- 
way of the maze. Berlyne (2) has 
presented a similar construct, a curios- 
ity drive, designed to explain curiosity 
in rats. (Working toward a very dif- 
ferent goal, that of incorporating per- 
ception and attention as “response” in 
Hullian theory, Berlyne (3) has also 
sketched the outlines of a theory that 
displays some points of similarity to the 
formulation presented below.) 


8 Reactive inhibition has to be assumed to 
disappear quickly in order that it meet its 
other theoretical commitments. (See Zeaman 
and House [23].) 
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A THEORY IN TERMS OF THE STIMULUS 


We shall assume that, with continued 
exposure to an environment—to the 
same stimuli—the organism becomes 
less active in that environment. With 
an eye to several experiments (5, 9, 19, 
23), we assign a specific role to time in 
the disappearance of this boredom-like 
effect. Since, moreover, the explanation 
is in terms of the stimulus, it would be 
expected that “boredom” created in one 
situation would carry over to other simi- 
lar environments and that the more 
similar two environments are, the greater 
the carryover. 

Using these ideas to form the con- 
struct of stimulus satiation,‘ we shall 
follow Hull’s procedure (10) and pre- 
sent the basis of the theory in the form 
of a postulate. 

This new postulate may not only en- 
able us to circumvent the difficulties of 
the reactive-inhibition explanation but 
will also yield novel predictions. Fur- 
thermore, it may serve to interrelate 
and explain empirical results from a 
number of apparently unrelated areas. 

The postulate reads as follows: 


Each moment an organism perceives a 
stimulus-object or stimulus-objects, A, there 
develops a quantity of stimulus satiation 
to A. 

i. The same amount of stimulus satiation 
develops in each successive moment. The 
total amount developed is, therefore, an in- 
creasing linear function of time. 

ii. There is loss of part of each quantity 
of stimulus satiation in each successive mo- 
ment. The amount of stimulus satiation 
remaining from each quantity is a decreas- 
ing negative exponential function of time. 

iii. Stimulus satiation developed to A 
will be generalized to other stimulus-ob- 
jects B. The amount of generalized stimu- 
lus satiation is an inverse function of the 
discriminability of A and B. 

4This term is suggested by Karsten’s work 
(12). 
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iv. The various quantities of stimulus 
satiation combine additively. 

v. Stimulus satiation reduces the organ- 
ism’s tendency to make any response to A. 

Definition 1. Generalization. When a 
change in behavior toward A results in a 
similar change in behavior toward B, all 
other things being equal, generalization has 
taken place from A to B. 

Definition 2. Discriminability. The ease 
with which a subject can be induced to 
make a different response to A than to B, 
all other things being equal, is called the 
discriminability of A and B. 


Two corollaries follow from the 
postulate: 

Corollary I. As long as an organism re- 
mains perceiving A, the amount of stimulus 
satiation it has to A at a given moment 
(that is, the total amount developed [i] 
minus the total amount lost [ii]), is an in- 
creasing negative exponential function of 
time, 

Corollary II. When an organism stops 
perceiving A, the amount of stimulus satia- 
tion it has to A at a given moment is a de- 
creasing negative exponential function of 
time. 


These corollaries are presented in Table 
1 and Fig. 1 with arbitrary values as- 
signed to the functions. 

Given the characteristics of various 
situations, the postulate yields a large 


number of deductions. A number of 
these for some basic types of situations 
are formulated and discussed below. 
For each deduction discussed, we will 
indicate whether or not it is crucial for 
the two theories, e.g., one in terms of 
the stimulus as against one in terms of 
the response, as well as state the nature 
of any existing evidence. 


DEDUCTIONS FOR APPROACH RESPONSES 
IN A Two-ALTERNATIVE SITUATION 
Spontaneous Alternation 


Let us start with a very simple situa- 
tion.. This situation is one in which S 
has evenly spaced, consecutive trials in 





260 Murray GLANZER 


TABLE 1 


Coro.varies I AND II ILLUSTRATED BY OBTAINING VALUES FOR STIMULUS SATIATION, 
I, = 100 e» (WHERE m IS THE NUMBER OF MOMENTS SINCE 
THE APPEARANCE OF THE STIMULUs)* 











Moments 





Stimulus 
Satiation 
Quantity 


Stimulus Present Stimulus Absent 





2 3 4 5 





5.0 
13.5 
36.8 

100.0 
155.3 


36.8 1.8 . 
100.0 5.0 1. 
13.5 5. 
3. 
a. 


13.5 
36.8 
100.0 
36.8 1 
57.1 2 


1. 
2. 


100.0 136.8 150.3 





























* Each row gives the history of a single quantity of stimulus satiation as it decreases with time. 


Each column 


gives the total amounts of stimulus satiation present during a single moment. 


an apparatus that contains two distinct 
areas, such as a single-unit T or Y 
maze. A trial starts with S’s entrance 
into the apparatus and ends with its 
exit from the apparatus. 

In order to complete a trial S has to 
enter one of the two areas or alleys. 
These areas should be initially more 
or less equally attractive to S. Dur- 
ing the course of the experiment, fur- 
thermore, one area should not be made 
more or less attractive than the other 


STIMULUS PRESENT 





by introducing reward or punishment. 
A situation that has these characteris- 
tics will be called a simple two-alterna- 
tive situation. 

The alternatives correspond to stimu- 
lus-objects A and B of the postulate. 
The S approaches and perceives one of 
the two on each trial. It will be as- 
sumed that S, upon entering an alterna- 
tive, “perceives” it. 

The S, perceiving one alternative, 
builds up stimulus satiation to it that 


! STIMULUS ABSENT 





Fie. 1. 


TIME 


Total amount of stimulus satiation, ~I,, as a function of time and the presence or 


absence of the stimulus-object, The totals of the columns of Table 1 are represented by this 


figure. 





SPONTANEOUS ALTERNATION 


reduces the tendency to respond to that 
alternative. The response considered 


here is the response of approach, since 
E usually records and requires only 
that response for completion of a trial. 


Deduction 1. In the simple two-alterna- 
tive situation, the subject will alternate 
choice of arms or alleys in successive trials. 

Verified; not crucial. 


This is the simple fact of spontaneous 
alternation which has been repeatedly 
demonstrated (4, 5, 9, 11, 19, 20, 22, 
23). 


Deduction 2. In the simple two-alterna- 
tive situation, if there is an exchange be- 
tween trials of stimulus-objects so that the 
cues that were on one side of the S now 
are on the other, and vice versa, S will al- 
ternate approach to stimulus-objects in suc- 
cessive trials. This will mean a repetition 
of responses rather than an alternation of 
responses (turns). 

Verified; crucial. 


This is a key deduction since it con- 
trasts the stimulus satiation and reac- 
tive-inhibition explanation most sharply. 
Stimulus satiation theory predicts a re- 
duction in alternation of response, 
whereas a theory in terms of the re- 
sponse predicts no decrease. 

The author (8) ran rats in a cross- 
shaped maze with two starting boxes 
(north and south) , making it possible for 
the Ss to start from one starting box for 
one trial and from the opposite starting 
box for the next. When this was done, 
the position of the alleys relative to S 
was reversed: the alley that had been on 
its right during Trial 1 was on its left 
for Trial 2. By thus shifting stimuli, it 
could be determined whether the ani- 
mals alternated responses as required 
by the reactive-inhibition explanation 
or alleys as required by the stimulus 
satiation explanation. 

Twenty-six rats were given two im- 
mediately consecutive trials a day for 
eight days. On four of the days both 
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trials were from the same starting box; 
on the other four days the trials were 
from opposite starting boxes. As pre- 
dicted above, the animals tended to re- 
peat (ie., go right-right or left-left) 
rather than alternate their responses on 
the days that the cues were shifted. 
This indicates clearly that they were 
alternating alleys (stimuli) rather than 
turns (responses). A similar experi- 
ment has been described by Mont- 
gomery (18) who reports results con- 
sistent with these findings. 


Deduction 3. In the simple two-alterna- 
tive situation if an extended series of con- 
secutive trials is given, the tendency to- 
ward spontaneous alternation will be at a 
maximum for the first pair of trials, and 
will decrease to a minimum in later trials. 
This decrease will be called cumulation- 
effect. 

Tested but not clearly verified; 
crucial, 


not 


In a series of consecutive trials in a 
two-alternative situation, if the first 
trial was to alternative A, then the sec- 
ond trial would be to alternative B (De- 
duction 1). There would be two stimu- 
lus satiation curves then: one for alter- 
native A starting from the first trial, 
and continuing its rise through later 
trials to alternative A; the other for 
alternative B, starting from the second 
trial. Since, according to Corollary I, 
curves of stimulus satiation are nega- 
tively accelerated, the two curves would 
approach each other in the later trials. 
The difference in amount of stimulus 
satiation to the two alternatives would 
therefore decrease as illustrated in 
Fig. 2, leading to the predicted decrease 
in spontaneous alternation. 

Although results from Wingfield and 
Dennis (22), Heathers (9) and the au- 
thor (7) show the general trend to- 
ward decrease in spontaneous alterna- 
tion from first to last trial in a series, 
the data also show reversals and con- 
siderable irregularity. 














END OF TRIAL: ! 2 


3 4 5 6 


“1c. 2. Total amount of stimulus satiation, =I, to two alternatives, A and B, in successive 


alternating trials. 


The curves, approaching their limits, cause a decrease in the difference (the 


distance between the curves) in the amount of stimulus satiation to the two alternatives. All 
odd-numbered trials are to A, even-numbered to B. Decreases occurring during intertrial in- 


tervals have not been drawn. 


The inconclusive nature of these 


data may result from the very pro- 
cedure indicated by Deduction 3. In 
this procedure the animal is free to 
choose either alternative each trial in 


the series. The E therefore soon loses 
control of the basic variable, the num- 
ber and order of particular choices 
that precede a given trial. 

In order to make an adequate test of 
the prediction, it may be necessary 
either to use a group of animals large 
enough to cancel individual differences 
in the pattern of choices or to change 
the procedure in order to control the 
pattern of choices throughout. Clearer 
results would be expected, then, for the 
following deduction. 


Deduction 4. In the simple two-alterna- 
tive situation, if a number of forced con- 
secutive alternation trials is given, as the 
number of trials increases, the tendency 
toward spontaneous alternation decreases. 

Not tested; not crucial. 


Now we turn to consideration of the 
role of generalization. On the basis of 
part iii of the postulate, the prediction 
could be made that the more discrimi- 
nable the alternatives, the greater the 
amount of spontaneous alternation. 
Discriminability can be varied by sev- 
eral methods. One is to eliminate or 
add cues that differentiate the two 
alternatives. 


Deduction 5. In the simple two-alterna- 
tive situation, as stimuli differentiating the 
alternatives are eliminated, spontaneous al- 
ternation will decrease. 

Tested but not verified; crucial. 


A study on this by the author (7) 
comparing spontaneous alternation in 
Y mazes with many and few differen- 
tiating cues yielded negative results. 
In that experiment, however, there may 
have been inadequate control of the 
differentiating cues, since only intra- 
maze cues were varied. 
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Since discriminability is a function of 
the organism as well as the environ- 
ment, the following deductions can also 
be derived. 


Deduction 6. In the simple two-alterna- 
tive situation, if the ability of the organism 
to discriminate is reduced, spontaneous al- 
ternation will decrease. 


Of the conditions of the organism 
that may affect its ability to discrimi- 
nate, two may be singled out on the 
basis of empirical studies. One is sense- 
organ damage and the other is brain- 
injury (15). If these conditions may 
be considered to reduce discriminability, 
then the following more specific deduc- 
tions may be made.° 


Deduction 6a. In the simple two-alterna- 
tive situation, the greater the extent of S’s 
sense-organ damage, the less the amount of 
spontaneous alternation. 

Not tested; crucial. 

Deduction 6b. In the simple two-alter- 
native situation, the greater the extent of 
S’s brain damage, the less the amount of 
spontaneous alternation. 

Verified; crucial. 


Certain of Krechevsky’s (13, 14) re- 
sults support this deduction. He found 
that brain-operated rats perseverated in 
choice of one of two paths much more 
than did normal rats. His situation de- 
scribed below was somewhat compli- 
cated but the basic two-alternative ar- 
rangement was employed. 

Now we turn to the effects of time 
on spontaneous alternation. 


Deduction 7. In the simple two-alter- 
native situation, as the interval between 
trials increases, the amount of spontaneous 
alternation decreases. 

Verified; not crucial. 


This deduction follows simply from 
part ii of the postulate. It has been 


5 Strictly speaking, Deductions 6a, 6b, 15 
hold only for situations in which there is em- 
pirical support for the stated relationship be- 
tween brain or receptor injury and discrimi- 
nability. 
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verified by Heathers (9) who found 
that, as the time interval between trials 
increased from 15 sec. to 15 min., spon- 
taneous alternation decreased from 86 
to 50 per cent (chance). It is also 
verified by Zeaman and House (23), 
who found a decrease over a period of 
24 hr. 


Deduction 8. In the simple two-alterna- 
tive situation, the more highly differenti- 
ated the alternatives, the longer the time 
interval between trials necessary for the 
disappearance of spontaneous alternation. 

Not tested; crucial. 


This may, in part, explain the widely 
discrepant data that have been ob- 
tained concerning the effects of delay 
on spontaneous alternation. For ex- 
ample, Walker,® using a highly differ- 
entiated Y maze, found spontaneous 
alternation with intervals longer than 
an hour. Montgomery (16), who took 


special care to keep his alternatives 
alike, found that spontaneous alterna- 
tion disappeared with intervals as short 


as 90 sec. 

Now we turn to an implication of 
Corollary I, that the longer the time the 
organism perceives a_ stimulus-object 
A, the greater the amount of stimulus 
satiation to A. 


Deduction 9. In the simple two-alterna- 
tive situation, if S is detained in one alter- 
native, the amount of spontaneous alterna- 
tion to the other alternative on the follow- 
ing trial will be greater, the longer the time 
of detention. 

Not tested; crucial. 


The apparent contradiction between 
Deduction 9 and Deduction 7 may be 
resolved in the following way. In De- 
duction 7, the concern is with “interval 
between trials,” an interval spent out- 
side the alternative chosen, whereas in 
Deduction 9, the concern is with a time 
interval spent within the alternative. 
These distinctions can be kept clear in 

® See footnote 2. 
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the formulation of a new deduction in 
terms of the interval between choices. 


Deduction 9a. In the simple two-alter- 
native situation, with the time interval be- 
tween choices constant, the greater the pro- 
portion of time between choices spent in 
the presence of the chosen alternative, the 
greater the amount of spontaneous alterna- 
tion on the succeeding trial. 


Deductions 7 and 9 can now be com- 
bined in the following. 


Deduction 10. In the simple two-alter- 
native situation, if S is delayed between 
choices and this delay occurs within the 
alternative last chosen, S will show more 
spontaneous alternation than under a con- 
dition of no delay. If, however, S is de- 
layed between choices and this delay occurs 
outside the alternative last chosen, S will 
show less spontaneous alternation than un- 
der a condition of no delay. 

Verified; crucial. 


The author (8) found that when rats 
were detained in the end box of the al- 
ternative chosen in the first run in a 
T maze, 96 per cent of the group 
showed spontaneous alternation on the 
following trial. When they were kept 
in other parts of the maze at the end 
of their first run, between 36 and 68 per 
cent of the group showed spontaneous 
alternation. When they were not de- 
tained, spontaneous alternation varied 
between 72 and 88 per cent. 


Deduction 11. In the simple two-alter- 
native situation, if S is given forced runs 
to one alternative, the greater the number 
of such forced trials, the greater the 
amount of spontaneous alternation on a 
subsequent free-choice trial. 

Verified; not crucial. 


This procedure is basically the same 
as that involved in Deduction 9. In 
Deduction 11 the animal builds up 
stimulus satiation during repeated visits 
to one alternative; in Deduction 9 it 
does so during a period of detention 
within the alternative. Zeaman and 
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House, deriving their hypothesis from 
the reactive-inhibition postulate, veri- 
fied Deduction 11. ; 

The amount of spontaneous alterna- 
tion obtained by each of the two pro- 
cedures, by forced trials and by deten- 
tion at the alternative, may now be 
compared to yield a clearly critical 
deduction. 


Deduction 12. In the simple two-alter- 
native situation, detention of S for a given 
period of time at the alternative last chosen 
will yield larger amounts of spontaneous 
alternation than will a series of forced 
trials to one alternative over the same pe- 
riod of time. 

Not tested; crucial. 


In the forced-trial procedure, as com- 
pared with the detention procedure, S, 
being taken from end box to starting 
box, spends a smaller proportion of the 
time within the alternative itself. Thus 
S has less time to develop stimulus 
satiation. 


Variability (Preference for Variable 


Situations ) 


An important variation of the two- 
alternative situation is one in which the 
cues of one alternative are varied while 
the cues of the other are kept constant. 
This type of situation has been investi- 
gated in two studies of “variability.” 
Krechevsky (13, 14) used a two-alter- 
native maze, both arms of which con- 
tained a number of turns. One arm 
could be varied in pattern of turns from 
trial to trial. Both variable and con- 
stant arms were constructed so that 
there was an equal number of right and 
left turns within each arm. 

Three different maze arrangements 
were used: a long varying versus a 
short constant path (13); a short vary- 
ing versus a long constant path (14); 
a long constant versus a short constant 
path (14). In each of these Krechev- 
sky ran two groups of rats, one group 
normal and the other brain-operated. 
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In such a situation, stimulus satiation 
to the varying alternative will be re- 
duced by its division among the differ- 
ent forms of the varying alternative. 
Essentially, a situation exists in which 
the constant alternative A is opposed 
by a family of alternatives B,, B,, B,, 
...B,. Alternative A bears the weight 
of all of the stimulus satiation built up 
during an exposure to it; the family of 
alternatives B divides stimulus satiation 
among its variations. 


Corollary III. If an alternative is varied 
from trial to trial, then less stimulus satia- 
tion will be built up to it than if no such 
variation occurs. 

Deduction 13. In the simple two-alter- 
native situation, if one of the alternatives 
is varied from trial to trial and the other 
is kept constant, then S will prefer the 
varied alternative. 

Verified; crucial. 


This is verified by Krechevsky (13, 
14). Examination of the results for 
his normal animals shows that the pre- 
dicted preference is strong enough to 
cause Ss to prefer the varied path even 
when it is the longer path.” 

It follows that the division of stimu- 
lus satiation among the variations of 
the changing alternative depends on the 
degree of discriminability of the varia- 
tions. As the variations become less 
and less discriminable, the condition of 
the constant alternative, i., of no 
change, is approached. 


Corollary IV. If an alternative is varied 
from trial to trial, the more discriminable 
the variations the less the amount of stimu- 
lus satiation that will be present for the 
alternative. 


7 Krechevsky’s findings cannot be explained 
in terms of reactive inhibition or any other 
theory in terms of the response since, as 
pointed out earlier, the mazes were designed 
so that each arm had the same number of 
right and left turns within it. The Ss would 
have, following each trial, an equal amount 
of reactive inhibition to right and left turning. 
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There were indicated earlier two 
ways to reduce discriminability: reduc- 
tion of the number of distinguishing 
cues of the perceived objects and im- 
pairment of the sensory or nervous sys- 
tems of the Ss. Consideration of these 
methods leads to the following deduc- 
tions. 


Deduction 14. In the simple two-alter- 
native situation, the preference for a vary- 
ing alternative (as opposed to a constant 
alternative) will be greater, the greater the 
difference between variations. 

Not tested; crucial. 

Deduction 15. In the simple two-alter- 
native situation, brain-injured Ss will show 
less preference for a varying alternative 
(as opposed to a constant alternative) than 
will normal Ss. 

Verified; crucial. 


In Krechevsky’s studies (13, 14) the 
normal animals preferred the varying 
alternative more strongly than did the 
operated animals. 

Finally, it can be deduced that the 
greater the number of variations that 
share the stimulus satiation, the less 
the amount present to each. 


Deduction 16. In the simple two-alter- 
native situation, the greater the number of 
variations, the greater the preference for 
the varying alternative. 

Not tested; crucial. 


DEDUCTIONS FOR THE MULTIPLE- 
ALTERNATIVE SITUATION 


We may now generalize the deduc- 
tions to apply to situations with more 


than two alternatives. For example, 
let us apply the postulate to the be- 
havior of S in the three-alternative 
situation. The S, after choosing one 
alternative, will build up stimulus satia- 
tion to it. Upon returning to the choice 
point for a second trial, S will probably 
choose a new alternative since respon- 
siveness to the first choice has been 
reduced, If returned again to the 
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choice point, S will probably choose the 
“unsatiated” third alternative. 


Deduction 1’. In a simple multiple-al- 
ternative situation, S will choose many al- 
ternatives in a series of consecutive trials. 

Definition 3. Many alternatives mean 
more alternatives than could reasonably be 
expected on the basis of chance. 


Deduction 1’ is verified by the find- 
ings of Wingfield and Dennis (22) on 
the number of different paths chosen by 
rats running four trials on a four- 
alternative maze. 

Deduction 1 appears as simply a 
special case of Deduction 1’. In fact, 
a set of more general deductions can 
now be obtained in most cases merely 
by changing “two-alternative” to “mul- 
tiple-alternative” and “spontaneous al- 
ternation” to “tendency to choose many 
alternatives.” Each of the 16 earlier 
deductions becomes a special case of 
one of the new set of 16 deductions for 
multiple-alternative situations. 


Consideration of situations involving 
more than two alternatives suggests a 
new variable, the number of alterna- 
tives, which can form the basis of sev- 


eral new deductions. One such deduc- 
tion, of particular interest, concerns the 
cumulation effect of Deductions 3 and 
5 and its counterpart in multiple-alter- 
native situations, i.e., the decrease in 
the tendency to choose many alterna- 
tives in an extended series of trials. 


Deduction 17’. If there are two simple 
multiple-alternative situations, and one of 
them has a greater number of alternatives 
than the other, then the cumulation effect 
will be smaller and will develop more slowly 
for the situation with a greater number of 
alternatives. 


Cumulation effect depends on the 
summation of amounts of stimulus satia- 
tion from successive visits to the same 
alternative. In the two-alternative sit- 
uation, S visits alternative A and then 
B, then A again, then B again, rapidly 
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building up near-maximum amounts of 
stimulus satiation to both alternatives. 
According to Deduction 1’, the S, in a 
three-alternative situation, visits A then 
B then C and then A again, B again, 
and C again. The time between suc- 
cessive visits to the same alternative is 
longer, thereby delaying the onset of 
maximum stimulus satiation for any 
single alternative. It will consequently 
take more time to reach the period 
when differences in the amounts of 
stimulus satiation present for the vari- 
ous alternatives approach zero. 


DEDUCTIONS FOR “FREE” PROCEDURE 


Deductions have been drawn thus far 
only for the procedure in which S$ 
makes a choice and then is returned by 
E to the starting point for the next 
trial. Another procedure, which we 
shall call the “free” procedure, is one 
that allows S$ to make as many choices 
as it will for a given period of time. 
The S makes a choice and then returns 
itself to the choice point for its next 
choice. When employed without re- 
wards, this is the standard procedure in 
the study of “exploratory behavior.” * 

Exploratory behavior may now be 
derived from the stimulus satiation 
postulate. The deductions can be fur- 
ther generalized to cover this more 
complex procedure so that exploratory 
behavior, like spontaneous alternation, 
becomes a special case governed by the 
postulate. The word “choice” replaces 
the word “trial” in the deductions, since 
we are no longer concerned with a trial 
as defined earlier; and the procedure is 
no longer called “simple.” 

Most of the deductions have not been 
tested in the “free’’ procedure. Those 

8 The relationship between spontaneous al- 
ternation and exploratory behavior was pointed 
out by Tolman (20) who explained the for- 
mer in terms of the latter. We, however, view 
both spontaneous alternation and exploratory 
behavior as manifestations of a single under- 
lying factor. 





SPONTANEOUS ALTERNATION AND RELATED PHENOMENA 


for which there are relevant data have 
been verified. For example, there is the 
following deduction involving cumula- 
tion effect. 


Deduction 3”. In the multiple-alterna- 
tive situation, if an extended series of con- 
secutive choices is given, as the number of 
such choices increases, the tendency to 
choose many alternatives decreases. 


This has been verified by Dennis 
and Sollenberger (6) and later by 
Montgomery (17). Both studies found 
that rats enter fewer and fewer alleys 
in successive periods of free exploration. 

Another deduction for which there 
are data is the following: 


Deduction 17”. If there are two multi- 
ple-alternative situations and one of them 
has a greater number of alternatives than 
the other, then the cumulation effect will 
be smaller and develop more slowly for the 
situation with a greater number of alterna- 
tives. 


This has also been verified by Dennis 
and Sollenberger (6) and Montgomery 
(17). They found that larger mazes 
elicited more activity and continued to 
elicit activity over a longer period of 
time than did small mazes. Larger 
mazes are considered here to contain 
a greater number of alternatives. 


PossIBLE APPLICATIONS OF THE THEORY 
TO HuMAN SvuBJECTS 


Although most of the work in the 
areas under consideration has been car- 
ried out with rats as Ss, there is some 
evidence obtained from human Ss that 
is congruent with the postulate. 

The closest analogue to the two- 
alternative situation that elicits spon- 
taneous alternation in rats is found in a 
study by Wingfield (21) who required 
college students to turn on one of a 
pair of lights. In a series of four trials 
the Ss not only spontaneously alter- 
nated the lights they chose to turn on 
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(Deduction 1) but also showed more 
spontaneous alternation on the first 
pair than on the second pair of trials 
(Deduction 3) and more with lights of 
different hue than with lights of the 
same hue (Deduction 5). 

Karsten’s (12) work on the phenome- 
non which she calls “satiation” (Siat- 
tigung) is of particular interest. She 
had her Ss perform repeatedly such 
tasks as drawing lines or tapping.° 
This activity produces repeated experi- 
ence of certain stimulus-objects, e.g., 
lines drawn, and can therefore be con- 
sidered a case covered by Deduction 
13”. This deduction can be considered 
to predict Karsten’s finding of the Ss’ 
resistance to continuing the task as well 
as their variations from the prescribed 
activity..° The phenomenon of cosatia- 
tion (transfer of satiation symptoms to 
new and similar tasks) could also be 
predicted from part iii of the postulate. 

A further finding by Karsten sug- 
gests the relevance of the postulate to 
the problem of fatigue. 


Symptoms of 
satiation, including the Ss’ inability to 
move their arms, disappeared when the 
grouping of the lines being drawn was 
changed even though the required mus- 


cle movements remained the same. 
This recalls two things: part iii of the 
postulate and the recent emphasis on 
factors other than simple muscle states 
in understanding the phenomenon of 
fatigue (1). 


SUMMARY 


Because the reactive-inhibition expla- 
nation of spontaneous alternation in 
rats seems inadequate, a new theory in 
terms of the effects of continued ex- 
posure to stimuli is proposed. This is 

® Berlyne (3) handles these data in the same 
way in his expansion of Hull’s theory to in- 
clude perceptual “responses.” 

10 The Ss’ resistance also expressed itself in 
the form of inattentiveness and “fatigue.” All 
of these symptoms could be predicted on the 
basis of Corollary I. 
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presented in the form of a postulate 
with deductions drawn for various as- 
pects of the simple two-alternative sit- 
uation and available evidence for their 
validity presented. Following this, the 
theory is generalized to cover situations 
with any number of alternatives, and 
to situations that require other pro- 
cedures, including one that elicits ex- 
ploratory behavior. Finally the perti- 
nence of the theory to some aspects 
of human behavior is indicated. The 
postulate of stimulus satiation yields 
novel predictions and serves to unify 
under a single explanatory construct 
such diverse phenomena as spontaneous 
alternation, exploratory behavior, Kre- 
chevsky’s “variability,” and Karsten’s 
“satiation.” 


REFERENCES 


i. Barttey, S. H. Fatigue and efficiency. 
In H. Helson (Ed.), Theoretical foun- 
dations of psychology. New York: 
Van Nostrand, 1951. Pp. 318-348. 

. Bertyne, D. E. Novelty and curiosity as 


determinants of exploratory behavior. 
Brit. J. Psychol., 1950, 41, 68-80. 
. Bertyne, D. E. Attention, perception and 


behavior theory. 
58, 137-146. 

. Dennis, W. A comparison of the rat’s 
first and second exploration of a maze 
unit. Amer. J. Psychol., 1935, 47, 488- 
490. 

. Dennis, W. Spontaneous alternation in 
rats as an indicator of the persistence 
of stimulus effects. J. comp. Psychol, 
1939, 28, 305-312. 

. Dennis, W., & Sottenpercer, R. J. Nega- 
tive adaptation in the maze exploration 
of albino rats. J. comp. Psychol., 1934, 
18, 197-206. 

. Granzer, M. Stimulus satiation as an ex- 
planation of spontaneous alternation in 
rats. Unpublished Ph.D. dissertation, 
Univer. of Michigan, 1952. 

. Granzer, M. The role of stimulus satia- 
tion in spontaneous alternation. J. exp. 
Psychol., 1953, 45, 387-393. 

. Heatuers, G. L. The avoidance of repeti- 
tion of a maze reaction in the rat as a 
function of the time interval between 
trials. J. Psychol., 1940, 10, 359-380. 


Psychol. Rev., 1951, 


SRE OER 


MurrAy GLANZER 


. Hurt, C. L. Principles of behavior. New 
York: D. Appleton-Century, 1943. 

. Jackson, M. M. Reactive tendencies of 
the white rat in running and jumping 
situations. J. comp. Psychol., 1941, 31, 
255-262. 

. Karsten, A. Untersuchungen zur Handl- 
ungs- und Affektpsychologie: V. Psy- 
chische “Sattigung.” Psychol. Forsch., 
1928, 10, 142-154. 

. Krecuevsky, I. Brain mechanisms and 
variability: II. Variability where no 
learning is involved. J. comp. Psychol, 
1937, 23, 139-163. 

. Krecuevsky, I. Brain mechanisms and 
variability: III. Limitations of the ef- 
fect of cortical injury upon variability. 
J. comp. Psychol., 1937, 23, 351-364. 

5. Lasuiey, K.S. The mechanism of vision. 
II. The influence of cerebral lesions 
upon the threshold of discrimination for 
brightness in the rat. J. genet. Psychol., 
1930, 37, 461-480. 

. Montcomery, K. C. Spontaneous alterna- 
tion as a function of time between trials 
and amount of work. J. exp. Psychol, 
1951, 42, 82-93. 

. Montcomery, K. C. Exploratory behav- 
ior and its relation to spontaneous al- 
ternation in a series of maze exposures. 
J. comp. physiol. Psychol., 1952, 45, 
50-57. 

. Montcomery, K. C. A test of two ex- 
planations of spontaneous alternation. 
J. comp. physiol. Psychol., 1952, 45, 
287-294. 

. Soromon, R. L. The role of effort in the 
production of several related behavior 
phenomena. Unpublished Ph.D. disser- 
tation, Brown Univer., 1947. 

. Totman, E. C. Purpose and cognition: 
The determiners of animal learning. 
Psychol. Rev., 1925, 32, 285-297. 

. WrncriEtp, R. C. Some factors influenc- 
ing spontaneous alternation in human 
subjects. J. comp. Psychol., 1943, 35, 
237-244. 

. WincFrELp, R. C., & Dennis, W. The de- 
pendency of the rat’s choice of path- 
ways upon the length of the daily trial 
series. J. comp. Psychol., 1934, 18, 
135-147. 

. ZeEAMAN, D., & House, B. J. The growth 
and decay of reactive inhibition as 
measured by alternation behavior. J. 
exp. Psychol., 1951, 41, 177-186. 


[MS. received July 2, 1952] 





Psychological Review 
Vol. 60, No. 4, 1953 


THE BACKWARD CURVE: A METHOD FOR THE STUDY 
OF LEARNING 


KEITH J. HAYES 


Yerkes Laboratories of Primate Biology, Orange Park, Florida 


Graphical representation of the tem- 
poral course of improvement in per- 
formance has long been a popular re- 
search tool in the field of learning 
theory. Such curves have often been 
presented as evidence for basic mecha- 
nisms presumed to be involved in learn- 
ing. Potentially, this type of analysis 
is important: If it can be shown that 
the acquisition of simple associations 
is practically always a sigmoid (or 
linear, or discontinuous, etc.) function 
of training, then one theory may gain 
considerably more than another in pre- 
sumptive validity. 

Unfortunately, the type of curve 
commonly used for this purpose indi- 
cates the average performance of a 


group of Ss at successive stages of prac- 


tice. These curves are quite irrelevant 
to basic problems of learning theory, 
since their forms are determined not 
only by the forms of their component 
individual curves, but also by the dis- 
tributions of the individual curves. The 
influence of this second factor is illus- 
trated in Fig. 1, where it is seen that 
sigmoid average curves tend to result 
from symmetrical distributions of vari- 
ous kinds of individual curves.t When 
the distribution of individual curves has 
a strong positive skew, the resulting 


1Normally distributed sigmoid individual 
curves will give a similar average curve; but 
Merrell (3) has shown (in the context of 
logistic growth curves) that the average curve 
need not have the same form as its compo- 
nents. The average curve may not be expres- 
sible as y= K/1+e**™*, for instance, even 
though all of the individual curves are. Sid- 
man (5) has made this same point with ref- 
erence to the negatively accelerated function, 
y = M — Me-™. 


average curve tends toward predomi- 
nantly negative acceleration. 

The only curves which bear directly 
on basic theoretical problems are those 
which represent individual instances of 
learning—single associations acquired 
by individual subjects. However, it is 
difficult to obtain individual curves for 
the simpler learning situations. Com- 
plex tasks such as motor skills, long 
mazes, or difficult syllable lists may 
give reliable individual curves, but they 
are not directly applicable to the more 
basic problems. Puzzle boxes, dis- 
criminations, and conditioned responses 
are learned so quickly that we can 
rarely get as many as ten reliable sam- 
ples of performance during the course 
of training. Group curves are acceptable 
substitutes only when we can assume 
that they are representative of their 
component individual curves, and this 
assumption is seldom justified. 

Failure of an average curve to rep- 
resent individual learning is due to in- 
dividual differences among the subjects. 
If the only differences are in speed of 
learning, the general form of the learn- 
ing function can be approximated rea- 
sonably well by a Vincent curve. How- 
ever, we can seldom justify the assump- 
tion that individual curves do not also 
differ in form. Even if homogeneity 
of form exists, the irregularity of indi- 
vidual curves makes it difficult to dem- 
onstrate. In some cases, form may be 
systematically related to learning speed, 
and Hilgard (2, p. 293) has suggested 
that separate Vincent curves should 
then be drawn for fast, slow, and per- 
haps intermediate learning—which is 
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lines indicate discontinuity. 


practical only if an adequate number of 
subjects is available in each of the nec- 
essary speed categories. 

Where generally heterogeneous groups 
are involved, it is apparently impossi- 
ble to plot satisfactory average curves 
of the complete course of learning. 
However, certain theoretical issues can 
be dealt with by examining smaller seg- 
ments of the learning process, and a 
method is available for plotting average 
curves of such segments.? The general 

2A procedure very similar to the one de- 
scribed here has been used by Shuttleworth 


(4) in his study of the prepubertal spurt in 
human growth. 


The effect of distribution on the forms of average learning curves. 
hypothetical individual curves; heavy lines show the resulting average curves. 











Light lines show 
Broken vertical 


features of this method may be illus- 
trated by analysis of some data made 
available by Mr. Robert Thompson.* 
Forty rats were trained in a brightness 
discrimination problem, to a criterion 
of ten consecutive correct responses. 
Motivation was escape from water, and 
correction was aliowed. 

The traditional average curve for this 
experiment is shown in Fig. 2, where 
each point represents 400 choices—ten 
trials for each of 40 rats.* The light 

8Formerly with the Yerkes Laboratories, 
now at the University of Texas. 


* Means are based on the assumption that 
no more errors would be made after achieve- 
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Brightness discrimination learning in rats. 


The heavy line is the average learning 


curve (N = 40); light lines are individual curves for the Ist, 10th, 20th, 30th, and 40th rats to 


reach criterion. 


lines in the same figure are individual 
learning curves for the Ist, 10th, 20th, 
30th, and 40th animals to reach crite- 
rion. When due allowance has been 
made for the irregularity of the indi- 
vidual curves, the impression remains 
that the average curve has grossly dis- 
torted the individual data. In contrast 
to the gradual, negatively accelerated, 
group curve, the individual curves sug- 
gest a variable period of no progress, 
followed by uniformly sudden (or “in- 
sightful”) learning. 

The form of these individual curves 
raises the problem of how to plot an 
average curve which is capable of show- 
ing such an abrupt rise to mastery. 
The proposed solution is shown graphi- 
cally in Fig. 3. Here the individual 


ment of criterion. The “true” curve would be 
slightly lower, except near the origin. 


curves are displaced horizontally, so 
that their final points coincide. It is 
now possible to draw a single curve 
which does no serious injustice to any 
of the individuals. Such a curve will 
not tell us the average accuracy of per- 
formance on the x trial, but it will 
indicate the average accuracy x trials 
before achieving the criterion. 

The actual construction of this curve 
involves plotting percentages for suc- 
cessive trials, not working forward from 
Trial 1, but rather working backward 
from the criterion. The heavy line in 
Fig. 4 is such a backward learning 
curve, each point representing the per- 
centage correct on a single trial, for 40 
rats. 

Backward curves, like Vincent curves, 
should not be drawn through points 
based on the criterial trials or the im- 














‘ 
} 
‘ 
‘ 
Vv 


Fic. 3. The individual curves of Fig. 2, shiited horizontally 





——— 





backward N 











Ree 
Triels (Ave. curve) 


Fic. 4. Comparison of the traditional average curve with the backward curve. The hori- 
zontal position of the backward curve was arbitrarily chosen to make the two curves coincide at 
the 959% level. 
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Fic. 5. Average and backward curves for 23 cases of visual discrimination learning by mon- 


keys. 


to reach a criterion of 20 successive correct choices. 


mediately preceding trial (2, p. 286).° 
Because of this restriction, it is de- 


5 Attainment of criterion is due primarily to 
the subject’s increased ability, but it is also 
due in part to chance fluctuation—which the 
criterion catches on the upswing. If the cri- 
terion is not stringent enough, chance may con- 
tribute more than ability. Grant (1) has pre- 
pared tables from which appropriate criteria 
may be selected for discrimination training. 

If criterial trials are admitted in a back- 
ward curve, “insightful learning” can be dem- 
onstrated in a group of tossed coins. This 
may take time if a stringent criterion is used, 
but most coins will eventually choose heads 
ten times in a row if given enough training. 


Points are based on successive groups of 10 trials. 


From 90 to 392 trials were required 
(Data supplied by K. L. Chow.) 


sirable to have one or more postcriterial 
trials, to represent the final stage of 
learning. A postcriterial point was pro- 
vided in this case by reanalyzing the 
deta in terms of a criterion of nine suc- 
cessive correct trials, instead of the ten 
originally used in training. (If the rats 
had been trained with this analysis in 
mind, they would have been run for 
several trials beyond criterion, so that 
postcriterial slope, as well as level, could 
have been determined.) 


Coins differ from other species in showing no 
retention of the habit on postcriterial trials. 
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Since some rats learned the problem 
very quickly, the curve can be carried 
backward only six trials from the cri- 
terion; before this point, progressively 
fewer rats would be included. Since 
our purpose is to examine only a small 
segment of learning, this is not a seri- 
ous disadvantage. If the curve could 
be carried further, the phase differences 
among individual subjects would be- 
come progressively larger, and the 
curve less valid—just as the forward 
curve becomes less representative as it 
gets farther from the first trial. 

This backward curve is intended to 
show just one thing—the course of 
learning in the immediate region of the 
criterion. Figure 4 indicates that in 
this experiment performance averaged 
little better than chance a few trials 
before the criterion, and that it rose 
through 80% on the second precriterial 
trial to 95% just after criterion. The 
individual curves are too coarse and ir- 


regular to give a clear picture of this 
sudden rise, and the traditional average 


curve does not show it at all. A 
Vincent curve for these data would be 
almost flat, ending below 70%. (A 
modified Vincent curve, including post- 
criterial data, would be more appro- 
priate.) 

Group data are commonly so hetero- 
geneous that the customary type of 
average curve is bound to give an im- 
pression of gradual learning, regardless 
of the actual facts. The question may 
arise as to whether the backward curve 
is foredoomed to give an impression of 
sudden learning. The answer is illus- 
trated in Fig. 5, which shows forward 
and backward curves of equal slope, 
both based on the same data. The 
backward curve does demonstrate grad- 
ual learning, when it occurs. 

Discrimination learning has been 
used in these examples only because 
discrimination data happened to be 
available. The general method is ap- 
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plicable to other learning situations as 
well and could, in fact, be used more 
effectively in some other cases. In dis- 
crimination work the criterion must in- 
clude a number of trials which, since 
they are not used in plotting, leave a 
large gap in the curve, between pre- 
and postcriterial points. This gap can 
be reduced to one trial in conditioning 
or puzzle-box learning, where the am- 
plitude or time score for a single trial 
may be used as a criterion. 

Although examples have been given 
of only one application of this method, 
it has more general possibilities. In 
addition to detecting sudden solution, 
it may be used to demonstrate other 
features (such as plateaus, peaks, or 
spurts) which would be obscured in the 
customary over-all group curves. The 
criteria used in defining the segments 
to be plotted would not be criteria of 
mastery, in such cases, and practical 
difficulties might be encountered in de- 
vising criteria which would be reason- 
ably short, yet adequately reliable. 

Special analysis was not entirely nec- 
essary in the examples given, since 
many of the subjects learned so slowly 
that the sudden (or gradual) nature of 
their final solutions was reasonably ap- 
parent in the individual curves. The 
advantage of the method would be 
greater in cases of faster learning, 
where usable individual curves cannot 
be obtained. It has some utility even 
with slow learning, however, since large 
numbers of individual curves are cum- 
bersome to deal with, and impractical 
to publish. 

Speculation as to why the rats of 
Fig. 4 learned so much more suddenly 
than the monkeys of Fig. 5 would be 
hazardous, since the two experiments 
differed in many ways. It may even- 
tually be profitable to look for the cause 
of such differences, but only when more 
cases of sudden learning are available. 
It is hoped that the procedure sug- 
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gested here may make some such cases 
manifest, which might otherwise remain 
latent in the form of inadequately ana- 
lyzed data. 


SUMMARY 


A method is proposed for plotting 
small segments of group learning curves 
in such a way that details are shown 
which would be obscured in the custo- 
mary types of average curves. 

The method is illustrated by search 
for sudden solution in two batches of 
discrimination data. Sudden learning 


was found in one case, and not in the 
other. 

Suggestions are offered for other ap- 
plications of the method. 
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A THEORY OF STIMULUS VARIABILITY IN LEARNING’ 


W. K. ESTES AND C. J. BURKE 


Indiana University 


There are a number of aspects of the 
stimulating situation in learning experi- 
ments that are recognized as important 
by theorists of otherwise diverse view- 
points but which require explicit rep- 
resentation in a formal model for ef- 
fective utilization. One may find, for 
example, in the writings of Skinner, 
Hull, and Guthrie clear recognition of 
the statistical character of the stimulus 
concept. All conceive a stimulating 
situation as made up of many compo- 
nents which vary more or less inde- 
pendently. From this locus of agree- 
ment, strategies diverge. Skinner (17) 
incorporates the notion of variability 
into his stimulus-class concept, but 
makes little use of it in treating data. 
Hull states the concept of multiple com- 
ponents explicitly (13) but proceeds to 


write postulates concerning the condi- 
tions of learning in terms of single 
components, leaving a gap between the 
formal theory and experimentally de- 


fined variables. Guthrie (11) gives 
verbal interpretations of various phe- 
nomena, e.g., effects of repetition, in 
terms of stimulus variability; these in- 
terpretations generally appear plausi- 
ble but they have not gained wide ac- 
ceptance among investigators of learn- 
ing, possibly because Guthrie’s assump- 
tions have not been formalized in a 
way that would make them easily used 


1 This paper is based upon a paper reported 
by the writers at the Boston meetings of the 
Institute of Mathernatical Statistics in Decem- 
ber 1951. The writers’ thinking along these 
and related lines has been stimulated and their 
research has been facilitated by participation 
in an interuniversity seminar in mathematical 
models for behavior theory which met at 
Tufts College during the summer of 1951 and 
was sponsored by SSRC. 
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by others. Statistical theories of learn- 
ing differ from Hull in making stimulus 
variability a central concept to be used 
for explanatory purposes rather than 
treating it as a source of error, and they 
go beyond Skinner and Guthrie in at- 
tempting to construct a formalism that 
will permit unambiguous statements of 
assumptions about stimulus variables 
and rigorous derivation of the con- 
sequences of these assumptions. 

It has been shown in a previous 
paper (7) that several quantitative as- 
pects of learning, for example the ex- 
ponential curve of habit growth regu- 
larly obtained in certain conditioning 
experiments, follow as consequences of 
statistical assumptions and need not be 
accounted for by independent postu- 
lates. All of the derivations were car- 
ried out, however, under the simplifying 
assumption that all components of a 
stimulating situation are equally likely 
to occur on any trial. By removing 
that restriction, we are now in a posi- 
tion to generalize and extend the theory 
in several respects. It will be possible 
to show that regardless of whether as- 
sumptions as to the necessary condi- 
tions for learning are drawn from con- 
tiguity theories or from reinforcement 
theories, certain characteristics of the 
learning process are invariant with re- 
spect to stimulus properties while other 
characteristics depend in specific ways 
upon the nature of the stimulating 
situation. 


THe GENERALIZED Set Monet: 
ASSUMPTIONS AND NOTATION 


The exposure of an organism to a 
stimulating situation determines a set 
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of events referred to collectively as 
stimulation. These events constitute 
the data of the various special disci- 
plines concerned with vision, audition, 
etc. We wish to formulate our model 
of the stimulus situation so that infor- 
mation from these special disciplines 
can be fed into the theory, although 
utilization of that information will de- 
pend upon the demands of learning ex- 
periments. 

For the present we shall make only 
the following very general assumptions 
about the. stimulating situation: (a) 
The effect of a stimulus situation upon 
an organism may be regarded as made 
up of many component events. (0d) 
When a situation is repeated on a series 
of trials, any one of these component 
stimulus events may occur on some 
trials and fail to occur on others; as a 
first approximation, at least, the rela- 
tive frequencies of the various stimulus 
events when the same situation (as de- 
fined experimentally) occurs on a series 


of trials, may be represented by inde- 


pendent probabilities. We formulate 
these assumptions conceptually as fol- 
lows: 

(a) With any given organism we as- 
sociate a set S* of N* elements.? The 
N* elements of S* are to represent all 
of the stimulus events that can occur 
in that organism in any situation what- 
ever with each of these possible events 
corresponding to an element of the set. 
(b) For any reproducible stimulating 
situation we assume a distribution of 
values of the parameter 6; we represent 
by 6 the probability that the stimulus 
event corresponding to the i element 
of S* occurs on any given trial. 


2In the sequel, various sets will be desig- 
nated by the letter S, accompanied by appro- 
priate subscripts and superscripts. The letter 
N, with the same arrangement of subscripts 
and superscripts, always denotes the size of 
the set. 
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It is assumed that any change in the 
situation (and we shall attempt to deal 
only with controlled changes correspond- 
ing to manipulations of experimental 
variables) determines a new distribu- 
tion of values of the 6. By repeating 
the “same” situation, we mean the same 
as described in physical terms, and we 
recognize that, strictly speaking, repeti- 
tion of the same situation refers to an 
idealized state of affairs which can be 
approached by increasing experimental 
control but possibly never completely 
realized. 

It is recognized that some sources of 
stimulation are internal to the organ- 
ism. This means that in order to have 
a reproducible situation in a learning 
experiment it is mecessary to control 
the maintenance schedule of the or- 
ganism and also activities immediately 
preceding the trial. In the present 
paper we shall not use the term “trial” 
in a sufficiently extended sense to neces- 
sitate including in the @ distribution 
movement-produced-stimulation arising 
from the responses occurring on the 
trial. 

We have noted that the behavior on 
a given trial is assumed to be a function 
of the stimulus elements which are 
sampled on that trial. If in a given 
situation certain elements of S* have a 
probability @=0O of being sampled, 
those elements have a negligible effect 
upon the behavior in that situation. 
For this reason we often represent a 
specific situation by means of a re- 
duced set S. An element of S* is in $ 
if and only if it has a non-zero value of 
6 in the given situation. These sets 
are represented in Fig. 1. In this con- 
nection, we must note that a prob- 
ability of zero for a given event does 
not mean that the event can never 
occur “accidentally”; this probability 
has the weaker meaning that the rela- 
tive frequency of occurrence of the 
event is zero in the long run. For a 
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A schematic representation of stimulus elements, the stimulus space S*, the reduced 


set S containing elements with non-zero @ values for a given stimulating situation, and the re- 
sponse classes A and A. The arrows joining elements of S to the response classes represent the 


partition of S into S4 and Sj 


more detailed explication of this point 
the reader is referred to Cramér (5). 

It should be clearly understood that 
the probability, 6, that a given stimulus 
event occurs on a trial may depend 
upon many different environmental 
events. For example, a stimulus event 
associated with visual stimulation may 
depend for its probability upon several 
different light sources in the environ- 
ment. Suppose that for a given stimu- 
lus element, the associated probability 
6 in a given situation depends only 
upon two separately manipulable com- 
ponents of the environment, a and 3, 
anc that the probabilities of the ele- 
ment’s being drawn if only @ or 5 alone 
were present are 6, and 6,, respectively. 
Then the probability attached to this 
element in the situation with both com- 
ponents present will be 


0 = 04 + 05 — O49». 


THE Response Moper 


The response model formulated in a 
previous paper (7) will be used here 
without any important modification. 
We shall deal only with the simple case 
of two mutually exclusive and exhaus- 
tive response classes. The response 
class being recorded in a given situa- 
tion will be designated A and the com- 
plementary class, A. The dependent 
variable of the theory is the probability 
that the response occurring on a given 
trial is a member of class A. It is rec- 
ognized that in a learning experiment 
the behaviors available to the organism 
may be classified in many different 
ways, depending upon the interests of 
the experimenter. The response class 
selected for investigation may be any- 
thing from the simplest reflex to a com- 
plex chain of behaviors involving many 
different groups of effectors. Adequate 
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treatment of all levels of response 
specification would require the formula- 
tion of a model for the response space 
and will not be attempted in the pres- 
ent paper. Preliminary investigation 
of this problem leads us to believe that 
when a response model is elaborated, 
the theory developed in this paper will 
be found to hold rigorously for the most 
elementary response components and to 
a first approximation for simple re- 
sponse classes that do not involve 
chaining. For experimental verifica- 
tion of the present theory we shall 
look to experiments involving response 
classes no more complex than flexing a 
limb, depressing a bar, or moving a key. 


CONDITIONAL RELATIONS AND 
RESPONSE PROBABILITY 


We assume that the behavior of an 
organism on any trial is a function, not 
of the entire population of possible 
stimulus events, but only of those 
stimulus events which occur on that 
trial; further, when learning takes 
place, it involves a change in the de- 
pendency of the response upon the 
stimulus events which have occurred on 
the given trial. 

Conditional relations, or for brevity, 
connections, between response classes 
and stimulus elements are defined as in 
other papers on statistical learning 
theory (3, 7). The response classes 
A and A define a partition of S* into 
two subsets S,* and S;z*. Elements 
in S,4* are said to be “connected to” 
or “conditioned to” response A; those 
in Sz* to response A. The concept of 
a partition implies specifically that 
every element of S* must be connected 
either to A or to A but that no element 
may be connected to both simulta- 
neously.’ Various features of the model 
are illustrated in Fig. 1. 


3 The argument of this section could as well 
be given in terms of the set S as of S*, defin- 
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For each element in S* we define a 
quantity F,(m) representing the proba- 
bility that the element in question is 
connected to response A, i.e., is in the 
subset S,*, at the end of trial m. The 
mean value of F,(m) over S* is, then, 
simply the expected proportion of ele- 
ments connected to A, and if all of the 
6, were equal, it would be natural to 
define this proportion as the probabil- 
ity, p(m), that response A occurs on 
trial m+ 1. In the general case, how- 


ever, not all of the 4; are equal and the 
contribution of each element should be 
weighted by its probability of occur- 
rence, giving 


> » 0;F s(n) 1 
(1) 90) = 5G = wag oul. 
i 


It will be seen that in the equal 6 
case, expression (1) reduces to 


(2) p(n) = 75% F,(n) = E(F,(n)) 
which, except for changes in notation, is 
the definition used in previous papers 
(6, 7). 

The quantity p is, then, another of 
the principal constructs of the theory. 
It is referred to as a probability, firstly 
because we assume explicitly that quan- 
tities p are to be manipulated mathe- 
matically in accordance with the axioms 
of probability theory, and secondly be- 
cause in some situations p can be given 
a frequency interpretation. In any 
situation where a sequence of responses 
can be obtained under conditions of 
negligible learning and independent 
trials (as at the asymptote of a simple 
learning experiment carried out with dis- 
crete, well-spaced trials) the numerical 
value of p is taken as the average rela- 
tive frequency of response A. For all 
situations the construct p is assumed to 


ing S, and Sj as the partition of S imposed 
by the response classes A and A, 
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correspond to a parameter of the be- 
havior system, and we do not cease to 
speak of this as a probability in the 
case of a situation where it cannot be 
evaluated as a relative frequency. It 
has been shown in a previous paper (7) 
that p can be related in a simple man- 
ner to rate or latency of responding in 
many situations; thus in all applica- 
tions of the theory, p is evaluated in 
accordance with the rules prescribed by 
the theory, either from frequency data 
or from other appropriate data, and 
once evaluated is treated for all mathe- 
matical purposes as a probability. 


REPRESENTATION OF LEARNING 
PROCESSES 


In order to account for the gradual 
course of learning in most situations, 
a number of the earlier quantitative 
theories, e.g., those of Hull (13), Gul- 
liksen and VWolfle (10), Thurstone 
(18) have assumed that individual con- 
nections are formed gradually over a 
series of learning trials. Once we adopt 
a statistical view of the stimulating 
situation, however, it can be shown 
rigorously that not only the gradual 
course of learning but the form of the 
typical learning curve can be accounted 
for in terms of probability considera- 
tions even if we assume that connec- 
tions are fcrmed on an _ all-or-none 
basis. This being the case, there seems 
to be no evidence whatsoever that 
would require a postulate of gradual 
formation of individual connections. 
Psychologically an all-or-none assump- 
tion has the advantage of enabling us 
to account readily for the fact that 
learning is sudden in some situations 
and gradual in others; mathematically, 
it has the advantage of great simplicity. 
For these reasons, recent statistical 
theories of learning have adopted some 
form of the all-or-none assumption (3, 
7, 15). 


Under an all-or-none theory, we must 
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specify the probabilities that any stimu- 
lus element that is sampled on a given 
trial will become connected to A or to 
A. For convenience in exposition, we 
shall limit ourselves in this paper to 
the simplest special case, i.e., a homoge- 
neous series of discrete trials with 
probability equal to one that all ele- 
ments occurring on a trial become con- 
nected to response A. 

We begin by asking what can be said 
about the course of learning during a 
sequence of trials regardless of the dis- 
tribution of stimulus events. It will 
be shown that our general assumptions 
define a family of mathematical opera- 
tors describing learning during any pre- 
scribed sequence of trials, the member 
of the family applicable in a given situ- 
ation depending upon the 6 distribution. 
We shall first inquire into the charac- 
teristics common to all members of a 
family, and then into the conditions 
under which the operators can be ap- 
proximated adequately by the relatively 
simple functions that have been found 
convenient for representing learning 
data in previous work. 

Let us consider the course of learn- 
ing during a sequence of trials in the 
simplified situation. Each trial in the 
series is to begin with the presentation 
of a certain stimulus complex. This 
situation defines a distribution of @ over 
S* so that each element in S* has some 
probability, @,, of occurring on any trial, 
and we represent by S the subset of 
elements with non-zero 6 values; any 
element that occurs on a trial becomes 
connected to A (or remains connected 
to A if it has been drawn on a previous 
trial). For concreteness the reader 
might think of a simple conditioning 
experiment with the CS preceding the 
US by an optimal interval, and with 
conditions arranged so that the UR is 
evoked on each trial and decremental 
factors are negligible; the situation rep- 
resented by S is that obtaining from the 
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onset of the CS to the onset of the US, 
and the response probability p will re- 
fer to the probability of A in this situa- 
tion. The number of elements in S$ will 
be designated by N. For simplicity we 
shall suppose in the following deriva- 
tions that none of the elements in S are 
connected to A at the beginning of the 
experiment. This means that the learn- 
ing curves obtained all begin with Ny 
and p equal to zero. No loss of gen- 
erality is involved in this simplifica- 
tion; our results may easily be extended 
to the case of any arbitrary initial 
condition. 

The i element in S will still remain 
in Sj after the nm“ trial if and only if 
it is not sampled on any of the first m 
trials; the likelihood that this occurs 
is (1— 6)". Hence, if F,(m) repre- 
sents the expected probability that this 
element is connected to A after the n™ 
trial, we obtain: 


(3) F,(n) = 1 — (1 — 0,)". 


The expected number of elements in S 
connected to A after the m™ trial, 
E[N4(n)], will be the sum of these ex- 
pected contributions from individual 
elements: 


(4) ELNa(n)] = p F;(n) 
= } [i — (1—4,)"] 
= N- p » (1 — 6)". 


We are now in a position to express 
p, the probability of response A, as a 
function of the number of trials in this 
situation. By substituting for the term 
Fi(n) of equation (1) its equivalent 
from equation (3), we obtain the re- 
lation 


(5) p(n) holt — (1—0)") 


1 


281 


Equation (5) defines a family of 
learning curves, one for each possible 
@ distribution, and it has a number of 
simple properties that are independent 
of the distribution of the 6. It can 
easily be verified by substitution that 
there is a fixed point at p = 1, and this 
will be the asymptote approached by 
the curve of p(m) vs. m as m increases 
over all bounds. Members of the 
family will be monotonically increasing, 
negatively accelerated curves, approach- 
ing a simple negative growth function 
as the 6, tend toward equality. If all 
of the @ are equal to 6, equation (5) 
reduces to 


(6) p(n) = 1— (1 — 6)" 


which, except for a change in notation, 
is the same function derived previously 
(7) for the equal 6 case* and corre- 
sponds to the linear operator used by 
Bush and Mosteller (2) for situations 
where no decremental factor is in- 
volved. In mathematical form, equa- 
tion (6) is the same as Hull’s well- 
known expression for growth of habit 
strength, but the function does not 
have the same relation to observed 
probability of responding in Hull’s 
theory as in the present formulation. 

Except where the distribution func- 
tion of the 6 either is known, or can 
be assumed on theoretical grounds to 
be approximated by some simple ex- 
pression, equation (5) will not be con- 
venient to work with. In practice we 
are apt to assume equal @, and utilize 
equation (6) to describe experimental 
data. The nature of the error of ap- 
proximation involved in doing this can 
be stated generally. Immediately after 
the first trial, the curve for the general 
case must lie above the curve for the 


*This is essentially the same function de- 
veloped for the equal @ case in a previous 
paper (7); the terms @ and m of equation (6) 
correspond to the terms g=s/S, and T of 
that paper. 
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Response probability, in S, as a function of number of trials for the numerical ex- 
The solid curve is the exact solution for a population of elements, 
The dashed curve describes the equal 9 approxima- 


tion with §=0.2. Initially no elements of S are conditioned to A. 


equal @ case; the difference between the 
two curves increases for a few trials, 
then decreases until they cross (in con- 
structing hypothetical 6 distributions of 
diverse forms we have usually found 
this crossing in the neighborhood of 
the fourth to eighth trial); after cross- 
ing, the curves diverge to a smaller ex- 
tent than before, then come together as 
both go to the same asymptote at 
p=1. It can be proved that the 
curves for the general and special case 
cross exactly once as m goes from one 
to infinity. We cannot make any gen- 
eral statement about the maximum er- 
ror involved in approximating expres- 
sion (5) with expression (6), but after 
studying a number of special cases, we 
are inclined to believe that the error 
introduced by the approximation will 
be too small to be readily detectable 
experimentally for most simple learn- 
ing situations that do not involve com- 
pounding of stimuli. 

The development of equations (5) 
and (6) has necessarily been given in 
rather general terms, and it may be 
helpful to illustrate some of the con- 


siderations involved by means of a sim- 
ple numerical example. Imagine that 
we are dealing with some particular 
conditioning experiment in which the 
CS can be represented by a set S, com- 
posed of two subsets of stimulus ele- 
ments, S, and S,, of the sizes N, = N, 
= N/2, where N is the number of ele- 
ments in S. Assume that for all ele- 
ments in S, the probability of being 
drawn on any trial is #, = 0.3 and for 
those in S,, 0, = 0.1. Now we wish to 
compute the predicted learning curve 
during a series of trials on which A re- 
sponses are reinforced, assuming that 
we begin with all elements connected to 
A. Equation (5) becomes 
1 ’ 


X (1—0.3)"4+-N2(0.1) (1—0.1)"] 


RTT: , 
=1- 9 410-3(0.7) +0.1(0.9)"]. 


Plotting numerical values computed 
from this equation, we obtain the solid 
curve given in Fig. 2. 

Now let us approach the same prob- 
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lem, but supposing this time that we 
know nothing about the different 6 
values in the subsets S, and S, and are 
given only that § = 0.2. We now ob- 
tain predicted learning curves under 
the equal 6 approximation. Equation 
(6) becomes: 


p(n) = 1 — (1 — 0.2)" 


and numerical values computed from 
this yield the dashed curve of Fig. 2. 

Inspection of Fig. 2 shows that the 
exact treatment leads to higher values 
of p(m) on the early trials but to lower 
values on the later trials, the difference 
becoming negligible for large ». The 
reason, in brief, for the steeper curva- 
ture of the exact curve is that elements 
with high @ values are likely to be 
drawn, and therefore conditioned to A, 
earlier in the learning process than ele- 
ments with low @ values, and then be- 
cause they will tend to recur frequently 
in successive samples, to lead to rela- 
tively high values of . During the 
late stages of learning, elements with 
low @ values that have not been drawn 
on the early trials will contribute more 
unconnected elements per trial than 
would be appearing at the same stage 
with an equal @ distribution and will 
depress the value of p below the curve 
for the equal @ approximation. 

It should be emphasized that the 
generality of the present approach to 
learning theory lies in the concepts in- 
troduced and the methods developed 
for operating with them, not in the 
particular equations derived. Equa- 
tion (5), for example, can be expected 
to apply only to an extremely narrow 
class of learning experiments. On the 
other hand, the methods utilized in de- 
riving equation (5) are applicable to a 
wide variety of situations. For the in- 
terest of the experimentally oriented 
reader, we will indicate briefly a few of 
the most obvious extensions of the 
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theory developed above, limiting our- 
selves to the equal 6 case. 

As written, equation (6) represents 
the predicted course of conditioning for 
a single organism with an initial re- 
sponse probability of zero. We can 
allow for the possibility that an experi- 
ment may begin at some value of p(0) 
other than zero by rewriting (6) in the 
more general form 


(7) p(n) = 1— [1 — pO) ](1 — 4)” 


which has the same form as (6) ex- 
cept for the initial value. 

If we wish to consider the mean 
course of conditioning in a group of m 
organisms with like values of 6 but 
varying initial response probabilities, 
we need simply sum equation (7) over 
the group and divide by m, obtaining 


= =F p(n) 
1—[1 — p(0)](1 — 6)". 


The standard deviation of p(m) un- 
der these circumstances is simply 


(8) p(n) 





(9) o,(n) = Viz p?(n) — p(n) 
= (1 — 6)"¢,(0) 


where o,(0) is the dispersion of the 
initial p values for the group. Varia- 
bility around the mean learning curve 
decreases to zero in a simple manner 
as learning progresses. 

The treatment of counter-condition- 
ing, i.e., extinguishing one response by 
giving uniform reinforcement to a com- 
peting response, follows automatically 
from our account of the acquisition 
process. Returning to equation (6) 
and recalling that the probabilities of A 
and A must always sum to unity, we 
note that while response A undergoes 
conditioning in accordance with (6), 
response A must undergo extinction in 
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accordance with the function 


pa(n) = 1 — pa(m) = (1 — 6)". 

If, then, we begin with any arbitrary 
~(0) and arrange conditions so that 4 
is evoked and conditioned to all ele- 
ments drawn on each trial, the ex- 
tinction of response A will be given by 
the simple decay function 


(10) p(n) = p(0)(1 — 8). 


Again the mean and standard deviation 
of p(m) can easily be computed for a 
group of organisms with like values of 
6 but varying values of (0): 


(11)  — p(m) = p(0)(1 — 4)" 
(12) a(n) = (1 — 6)*0,(0). 


As in the case of acquisition, variability 
around the mean curve decreases to 
zero in a simple manner over a series of 
trials. 

Since variability due to variation in 
p(O) is reduced during both condition- 
ing and counter-conditioning, it will be 
seen that in general we should expect 
less variability around a curve of re- 
learning than around a curve of original 
learning for a given group of subjects. 


APPLICATION OF THE STATISTICAL 
MopeEt To LEARNING 
EXPERIMENTS 


Since our concern in this paper has 
been with the development of a stimu- 
lus model of considerable generality, it 
has been necessary in the interests of 
clear exposition to omit reference to 
most of the empirical material upon 
which our theoretical assumptions are 
based. The evaluation of the model 
must rest upon detailed interpretation 
of specific experimental situations. It 
is clear, however, that the statistical 
model developed here cannot be tested 
in isolation; only when it is taken to- 
gether with assumptions as to how 
learning occurs and with rules of cor- 
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respondence between terms of the 
theory and experimental variables, will 
experimental evaluation be possible. 
Limitations of space preclude a detailed 
theoretical analysis of individual learn- 
ing situations in this paper. In order 
to indicate how the model will be uti- 
lized and to suggest some of its ex- 
planatory potentialities we shall con- 
clude with a few general remarks 
concerning the interpretation of learn- 
ing phenomena within the theoretical 
framework we have developed. 

Application of the model to any one 
isolated experiment will always involve 
an element of circularity, for informa- 
tion about a given @ distribution must 
be obtained from behavioral data. 
This circularity disappears as soon as 
data are available from a number of 
related experiments. The utility of the 
concept is expected to lie in the possi- 
bility of predicting a variety of facts 
once the parameters of the @ distribu- 
tion have been evaluated for a situa- 
tion. The methodology involved has 
been illustrated on a small scale by an 
experiment (6) in which the mean @ 
value for an operant conditioning situa- 
tion was estimated from the acquisition 
curve of a bar-pressing habit and then 
utilized in predicting the course of 
acquisition of a second bar-pressing 
habit by the same animals under 
slightly modified conditions. 

When the statistical model is taken 
together with an assumption of associa- 
tion by contiguity, we have the essen- 
tials of a theory of simple learning. 
The learning functions (5), (6), and 
(10) derived above should be expected 
to provide a description of the course 
of learning in certain elementary ex- 
periments in the areas of conditioning 
and verbal association. It must be em- 
phasized, however, that these functions 
alone will not constitute an adequate 
theory of conditioning, for a number of 
relevant variables, especially those con- 
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trolling response decrement, have not 
been taken into account in our deriva- 
tions. In conditioning experiments 
where decremental factors are mini- 
mized, there is considerable evidence 
(1, 4, 9, 14, 16) that the curve of con- 
ditioning has the principal properties of 
our equation (5) and can be well ap- 
proximated by the equal @ case (7). 
The fact that functions derived from 
the model can be fitted to certain em- 
pirical curves is a desirable outcome, of 
course, but cannot be regarded as pro- 
viding a very exacting test of the 
theory; probably any contemporary 
quantitative theory will manage to ac- 
complish this much. On the other 
hand, the fact that the properties of our 
learning functions follow from the sta- 
tistical nature of the stimulating situa- 
tion is of some interest; in this respect 
the structure of the present theory is 
simpler than certain others, e.g., that 
of Hull (13), which require an inde- 
pendent postulate to account for the 


form of the conditioning curve. 

It should also be noted that devia- 
tions from the exponential curve form 
may be as significant as instances of 


good fit. From the present model we 
must predict a specific kind of deviation 
when the stimulating situation contains 
elements of widely varying 6 values. 
If, for example, curves of conditioning 
to two stimuli taken separately yield 
significantly different values of §, then 
the curve of conditioning to a com- 
pound of the two stimuli should be ex- 
pected to deviate further than either of 
the separate curves from a _ simple 
growth function. The only relevant ex- 
periment we have discovered in the 
literature is one reported by Miller 
(16); Miller’s results appear to be in 
line with this analysis, but we would 
hesitate to regard this aspect of the 
theory as substantiated until additional 
relevant data become available. 
Although we shall not develop the 
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argument in mathematical detail in the 
present paper, it may be noted that the 
statistical association theory yields cer- 
tain specific predictions concerning the 
effects of past learning upon the course 
of learning in a new situation. In gen- 
eral, the increment or decrement in p 
during any trial depends to a certain 
extent upon the immediately preceding 
sequence of trials. Suppose that we 
have two identical animals each of 
which has p(m) equal, say, to 0.5 at the 
end of trial m of an experiment, and 
suppose that for each animal response 
A is reinforced on trial m+1. The 
histories of the two animals are pre- 
sumed to differ in that the first animal 
has arrived at p(m)=0.5 via a se- 
quence of reinforced trials while the 
second animal has arrived at this value 
via a sequence of unreinforced trials. 
On trial » + 1, the second animal will 
receive the greater increment to p (ex- 
cept in the equal @ case); the reason is, 
in brief, that for both animals the 
stimulus elements most likely to occur 
on trial +1 are those with high 6 
values; for the first animal these ele- 
ments will have occurred frequently 
during the immediately preceding se- 
quence of trials and thus will tend to 
be preponderantly connected to A prior 
to trial m+ 1; in the case of the sec- 
ond animal, the high @ elements will 
have been connected to A during the 
immediately preceding sequence and 
thus when A is reinforced on trial 
n+ 1, the second animal will receive 
the greater increment in weight of con- 
nected elements. From this analysis it 
follows that, other things equal, a curve 
of reconditioning will approach its 
asymptote more rapidly than the curve 
of original conditioning unless extinc- 
tion has actually been carried to zero. 
How important the role of the unequal 
6 distribution will prove to be in ac- 
counting for empirical phenomena of 
relearning cannot be adequately judged 
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until further research has provided 
means for estimating the orders of mag- 
nitude of the effects we have mentioned 
here. 


SUMMARY 


Earlier statistical treatments of sim- 
ple associative learning have been re- 
fined and generalized by analyzing the 
stimulus concept in greater detail than 
heretofore and by taking account of the 
fact that different components of a 
stimulating situation may have differ- 
ent probabilities of affecting behavior. 

The population of stimulus events 
corresponding to an independent ex- 
perimental variable is represented in 
the statistical model by a mathematical 
set. The relative frequencies with 
which various aspects of the stimulus 
variable affect behavior in a given ex- 
periment are represented by set opera- 
tions and functions. 

The statistical model, taken together 
with an assumption of association by 
contiguity, provides a limited theory 
of certain conditioning phenomena. 
Within this theory it has been possible 
to distinguish aspects of the learning 
process that depend upon properties of 
the stimulating situation from those 
that do not. Certain general predic- 
tions from the theory concerning ac- 
quisition, extinction, and relearning, are 
compared with experimental findings. 

Salient characteristics of the model 
elaborated here are compared with 
other quantitative formulations of 
learning. 
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